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Tetrodotoxin-Sensitive Sodium Channel a-Subunit 

The present invention relates generally to sodium channel proteins and more 
particularly to a novel cloned ot-subunit of a voltage-gated, tetrodotoxin-sensitive sodium 
5 channel protein. The present invention further relates to its production by recombinant 
technology and nucleic acid sequences encoding for this protein. 

BACKGROUND OF THE INVENTION 
The basic unit of information transmitted from one part of the nervous system to 

1 0 another is a single action potential or nerve impulse. The transmission line" for these 
impulses is the axon, or nerve fiber. The electrical excitability of the nerve membrane has 
been shown to depend on the membrane's voltage-sensitive ionic permeability system that 
allows it to use energy stored in ionic concentration gradients. Electrical activity of the 
nerve is triggered by a depolarization of the membrane, which opens channels through the 

1 5 membrane that are highly selective for sodium ions, which are then driven inward by the 
electrochemical gradient. Of the many ionic channels, the voltage-gated or voltage-sensitive 
sodium channel is one of the most studied. It is a transmembrane protein that is essential 
for the generation of action potentials in excitable cells. An excellent review of sodium 
channels is presented in Catterall, TINS 16(12), 500-506 (1993). 

20 

The cDN As for several Na + channels have been cloned and sequenced. Numa et al , 
Annals of the New York Academy of Sciences 479, 338-355 (1986), describe cDNA from 
the electric organ of eel and two different ones from rat brain. Rogart, U.S. Patent No. 
5,380,836, describes cDNA from rat cardiac tissue. See also Cribbs et al Proc. Natl. Acad. 

25 Sci., 86, 8 1 70-8 1 74 (1 989). A peripheral nerve sodium channel, referred to as PN 1 , has 
been detected based on sodium current studies and hybridization to a highly conserved 
sodium channel probe by D'Arcangelo et al, J. Cell Biol. 122, 91 5-921 (1993). However, 
neither the DNA nor the protein were isolated and its complete nucleic acid and amino acid 
sequence were unidentified. A partial amino acid sequence was presented at the 23rd 

30 Annual Meeting of the Society for Neuroscience, November 7-12, 1993, Washington D.C., 
see Abstracts: Volume 19, Part 1 : Abstract 121 .7: "Nerve Growth Factor Treatment of 
PC 12 Cells Induces the Expression of a Novel Sodium Channel Gene, Peripheral Nerve 
Type 1 (PN1)'\ by B.L. Moss, J. Toledo- Aral and G. Mandel. 
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A sodium channel gene abundantly expressed in neurons and glia, referred to as 
NaCh6, was detected in 1995 based on RNase protection assays, in situ hybridization and 
RT-PCR hybridization. See Schaller et al, J. Neuroscience 15(5), 3231-3242 (1995). 

These studies have shown that the amino acid sequence of the Na + channel has been 
conserved over a long evolutionary period. These studies have also revealed that the 
channel is a single polypeptide containing four internal repeats, or homologous domains 
(domains I-IV), having similar amino acid sequences. Each domain folds into six predicted 
transmembrane a-helices or segments: five are hydrophobic segments and one is highly 
charged with many lysine and arginine residues. This highly charged segment is the fourth 
transmembrane segment in each domain (the S4 segment) and is likely to be involved in 
voltage-gating. The positively charged side chains on the S4 segment are likely to be paired 
with the negatively charged side chains on the other five segments such that membrane 
1 5 depolarization could shift the position of one helix relative to the other, thereby opening 
the channel. Accessory subunits may modify the function of the channel. 

Therapeutic utility in recombinant materials derived from the DNA of the numerous 
sodium channels have been discovered. For example, U.S. Patent No. 5,132,296 discloses 
20 purified Na + channels that have proven useful as therapeutic and diagnostic tools. 

Isoforms of sodium channels are divided into „subfamilies". The term „isoform" is 
used to mean distinct but closely related sodium channel proteins, i.e., those having an 
amino acid homology of approximately 60-80%. These also show strong homology in 

25 functions. The term „subfamilies" is used to mean distinct sodium channels that have an 
amino acid homology of approximately 80-95%. Combinations of several factors are used 
to determine the distinctions within a subfamily, for example, the speed of a channel, 
chromosomal location, expression data, homology to other channels within a species and 
homology to a channel of the same subfamily across species. Another consideration is an 

30 affinity to tetrodotoxin („TTX"). TTX is a highly potent toxin from the puffer or fugu 
fish which blocks the conduction of nerve impulses along axons and in excitable membranes 
of nerve fibers. TTX binds to the Na + channel and blocks the flow of sodium ions. 

Studies using TTX as a probe have shed much light on the mechanism and structure 
35 of Na + channels. There are three Na + channel subtypes that are defined by the affinity for 
TTX, which can be measured by the IC S0 values: TTX-sensitive Na + channels (IC 50 = 
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1 nM), TTX-insensitive N + channels (IC 50 = 1-5 /xM), and TTX-resistant Na + channels 
(IC 50 > 100 /xM). 

TTX-insensitive action potentials were first studied in rat skeletal muscle (Redfern 
5 et ai, Acta Physiol. Scand. 82, 70-78 (1971)). Subsequently, these action potentials were 
described in other mammalian tissues, including newborn mammalian skeletal muscle, 
mammalian cardiac muscle, mouse dorsal root ganglion cells in vitro and in culture, cultured 
mammalian skeletal muscle and L6 cells (Rogart, Ann. Rev. Physiol. 43, 71 1-725 (1980)). 

1 0 Dorsal root ganglia neurons possess both TTX-sensitive (IC 50 - 0.3 nM) and TTX- 

resistant (IC 50 ~ 100 fjM) sodium channel currents, as described in Roy et ai, J. Neurosci. 
12, 2104-2111 (1992). 

TTX-resistant sodium currents have also been measured in rat nodose and petrosal 
15 ganglia (Ikeda et ai, J. Neurophysiol. 55, 527-539 (1986) and Stea et ai, Neurosci. 47, 727- 
736(1992)). 



DESCRIPTION OF THE INVENTION 

The present invention relates to novel sodium channel proteins. Specific 
embodiments include the a-subunit of such sodium channels that are TTX-sensitive. 



In particular, the present invention relates to a purified and isolated DNA sequence 
encoding for a novel rat TTX-sensitive sodium channel protein and a splice variant thereof. 

25 The term "purified and isolated DNA" refers to DNA that is essentially free, i.e. contains 
less than about 30%, preferably less than about 10%, and even more preferably less than 
about 1% of the DNA with which the DNA of interest is naturally associated. Techniques 
for assessing purity are well known to the art and include, for example, restriction mapping, 
agarose gel electrophoresis, and CsCl gradient centrifugation. The term "DNA" is meant to 

30 include cDNA made by reverse transcription of mRN A or by chemical synthesis. 

Specifically, the invention encompasses DNA having the engineered versions 
(discussed in detail below) of the nucleotide sequences set forth in SEQ ID NOS:l and 2 
designated herein as nerve sodium channel types 4 and the splice variant 4a (PN4 and 
35 PN4a). These versions of the PN4 and PN4a sequence were produced by removing most 
of the untranslated sequences of the PN4 and PN4a cDNA and cloned into expression 
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vectors for functional analysis. The longer ^native" version of PN4 is shown in Fig. 3 
(SEQ ID NO:7). The complete „native" base pair sequence of PN4a has the same sequence 
shown in Fig. 3 and labeled rPN4 with the 30 base pair insert after position 2050. The 
PN4 and PN4a DNA sequences comprise cDNA sequences that encode the a-subunit of 
5 novel voltage-gated, TTX-sensitive sodium channels, specifically the amino acid sequences 
se* forth in Figs. 1 and 2 (SEQ ID NOS: 3 and 4). DNA sequences encoding the same or 
allelic variant or analog sodium channel protein polypeptides of the nervous system, 
through use of, at least in part, degenerate codons are also contemplated by this invention. 
The nucleotide sequences of SEQ ID NOS:l and 2 correspond to the cDNAs from rat. A 
10 homology search provided that the closest related sodium channel is found in the fugu 

(puffer fish), with 92% homology. The next closest channels are rat brain types I and II, at 
87.9% and rat brain type III at 87.3%. Homology to all other known channels drops off 
significantly thereafter. 

15 Additionally, it is believed that the novel voltage-gated, TTX-sensitive sodium 

channel is also expressed in tissue of other mammalian species such as humans, and that the 
corresponding gene is highly homologous to the rat sequence. Therefore, the invention 
includes cDNA encoding a novel mammalian voltage-gated, TTX-sensitive sodium channel. 

20 The invention not only includes the entire protein expressed by the cDNA 

sequences of SEQ ID NOS:l and 2, but also includes protein fragments. These fragments 
can be obtained by cleaving the full length proteins or by using smaller DNA sequences or 
polynucleotides to express the desired fragment. Accordingly, the invention also includes 
polynucleotides that can be used to make polypeptides of about 10 to 1500, preferably 10 

25 to 1 00, amino acids in length. The isolation and purification of such recombinant 
polypeptides can be accomplished by techniques that are well known in the art, for 
example preparative chromatographic separations or affinity chromatography. In addition, 
polypeptides can also be made by synthetic means which are well known in the art. 

30 In general, sodium channels comprise an a- and two J-subunits. The J-subunits 

may modulate the function of the channel. However, since the a-subunit is all that is 
required for the channel to be fully functional, expression of the cDNA in SEQ ID NOS:l 
and 2, will each provide a fully functional protein. The gene encoding the J r subunit in 
nerve tissue was found to be identical to that found in rat heart, brain and skeletal muscle. 

35 The cDNA of the J r subunit is not described herein as it is well known in the art (Isom et 
ai, Neuron 12, 1 183-1 194 (1994)). However, it is to be understood that by combining the 
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known sequence for the J r subunit with the a-subunit sequence described herein, one may 
obtain complete PN4 and PN4a rat voltage-gated, TTX-sensitive sodium channels. 

Northern blot analysis indicates that PN4 and PN4a are each encoded by a 
5 -7.5 kb/9.5 kb transcript. The nucleotide sequence analysis of the PN4 cDNA identifies a 
5934-base open reading frame, shown in SEQ ID NO:l, starting at base 22. The nucleotide 
sequence analysis of the PN4a cDNA identifies a 5964-base open reading frame, shown in 
SEQ ID NO:2, also starting at base 22. The deduced amino acid sequence of PN4, shown 
in Fig. 1 (SEQ ID NO:3), exhibits the primary structural features of an a-subunit of a 

10 voltage-gated, TTX-sensitive sodium channel. Shown in Fig. 1 are the homologous domains 
(I-IV); the putative transmembrane segments (S1-S6); the amino acid conferring sensitivity 
to TTX (A); potential cAMP-phosphorylation site (♦); and potential N-glycosylation site 
(♦). The deduced amino acid sequence of PN4a ; shown in Fig. 2 (SEQ ID NO:4) ; also 
exhibits the primary structural features of an a-subunit of a voltage-gated, TTX-sensitive 

1 5 sodium channel. Shown in Fig. 2 are the homologous domains (I-IV); the putative 
transmembrane segments (S1-S6); the amino acid conferring sensitivity to TTX (a); 
potential cAMP-phosphorylation site (•); and potential N-glycosylation site (♦). 

Reverse transcription-polymerase chain reaction (degenerate oligonucleotide-primed 
20 „RT-PCR") analysis of RNA from the rat central and peripheral nervous systems, in 
particular from rat dorsal root ganglia („DRG") was performed. Eight main tissue types 
were screened by RT-PCR for expression of the unique PN4 genes corresponding to 
positions 4646-5203 of SEQ ID NO:l. PN4 was present in five of the tissues studied: 
brain, spinal cord, DRG, nodose ganglia and superior cervical ganglia. PN4 was not present 
25 in the remaining tissues studied: sciatic nerve tissue, heart or skeletal muscle tissue. 

Three main tissue types were screened by RT-PCR for expression of the unique 
PN4a genes corresponding to positions 1947-2135 of SEQ ID NO:2. PN4a was present in 
two of the tissues studied: spinal cord and DRG. PN4a was not present in brain tissue. 

30 

The invention also pertains to the cloning and functional expression in Xenopus 
oocytes of the novel PN4 and PN4a rat TTX-sensitive sodium channels. Specifically, the 
a-subunit of the sodium channels was cloned and expressed. Functional expression shows 
that PN4 and PN4a are voltage-gated, TTX-sensitive sodium channels with properties that 
35 are similar to other TTX-sensitive sodium channels. 
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Preferred aspects of this invention are PN4 cDNA sequences which encode for the 
novel mammalian TTX-sensitive sodium channel proteins that are expressed in brain, spinal 
cord, dorsal root ganglia, nodose ganglia and superior cervical ganglia but not in sciatic 
5 nerve, heart or skeletal muscle when assayed by the methods described herein, such as RT- 
PCR. 

Also preferred aspects of this invention are PN4a cDNA sequences which encode 
for the novel mammalian TTX-sensitive sodium channel proteins that are expressed most 
1 0 strongly in DRG, with little expression in spinal cord and almost undetectable expression in 
brain when assayed by the methods described herein, such as RT-PCR. 

cDNA sequences which encode for the novel PN4 TTX-sensitive sodium channel 
proteins that are predominantly expressed in the brain and spinal cord are also 
1 5 contemplated by this invention. cDNA sequences which encode for the novel PN4a TTX- 
sensitive sodium channel proteins that are predominantly expressed in the DRG are also 
contemplated by this invention. 

The term „cDNA", or complementary DNA, refers to single-stranded or double- 
20 stranded DN A sequences obtained by reverse transcription of mRN A isolated from a donor 
cell. For example, treatment of mRNA with a reverse transcriptase such as AMV reverse 
transcriptase or M-MuLV reverse transcriptase in the presence of an oligonucleotide 
primer will furnish an RNA-DNA duplex which can be treated with RNase H, DNA 
polymerase, and DNA ligase to generate double-stranded cDNA. If desired, the double- 
25 stranded cDNA can be denatured by conventional techniques such as heating to generate 
single-stranded cDNA. The term „cDNA" includes cDNA that is a complementary copy 
of the naturally occurring mRNA as well as complementary copies of variants of the 
naturally occurring mRNA, that have the same biological activity. Variants would include, 
for example, insertions, deletions, sequences with degenerate codons and alleles. For 
30 example, PN4a is a splice variant of PN4, having a 10 amino acid insertion. 

The term „cRNA" refers to RNA that is a copy of the mRNA transcribed by a cell. 
cRNA corresponding to mRNA transcribed from a DNA sequence encoding the a-subunit 
of a novel TTX-sensitive sodium channel protein is contemplated by this invention. 

35 
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The present invention also includes expression vectors comprising the DNA or the 
cDNA described above, host cells transformed with these expression vectors capable of 
producing the sodium channel of the invention, and cDNA libraries comprising such host 
cells. 

The term "expression vector" refers to any genetic element, e.g., a plasmid, a 
chromosome, a virus, behaving either as an autonomous unit of polynucleotide expression 
within a cell or being rendered capable of replication by insertion into a host cell 
chromosome, having attached to it another polynucleotide segment, so as to bring about the 
replication and/or expression of the attached segment. Suitable vectors include, but are not 
limited to, plasmids, bacteriophages and cosmids. Vectors will contain polynucleotide 
sequences which are necessary to effect ligation or insertion of the vector into a desired 
host cell and to effect the expression of the attached segment. Such sequences differ 
depending on the host organism, and will include promoter sequences to effect 
transcription, enhancer sequences to increase transcription, ribosomal binding site 
sequences and transcription and translation termination sequences. 

The term "host cell 11 generally refers to prokaryotic or eukaryotic organisms and 
includes any transformable or transfectable organism which is capable of expressing a 
protein and can be, or has been, used as a recipient for expression vectors or other 
transferred DNA. Host cells can also be made to express protein by direct injection with 
exogenous cRNA translatable into the protein of interest. A preferred host cell is the 
Xenopus oocyte. 

The term "transformed" refers to any known method for the insertion of foreign 
DNA or RNA sequences into a host prokaryotic cell. The term „transfected" refers to any 
known method for the insertion of foreign DNA or RNA sequences into a host eukaryotic 
cell. Such transformed or transfected cells include stably transformed or transfected cells in 
which the inserted DNA is rendered capable of replication in the host cell. They also 
include transiently expressing cells which express the inserted DNA or RNA for limited 
periods of time. The transformation or transfection procedure depends on the host cell 
being transformed. It can include packaging the polynucleotide in a virus as well as direct 
uptake of the polynucleotide, such as, for example, lipofection or microinjection. 
Transformation and transfection can result in incorporation of the inserted DNA into the 
genome of the host cell or the maintenance of the inserted DNA within the host cell in 
plasmid form. Methods of transformation are well known in the art and include, but are 
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not limited to, viral infection, electroporation. lipofection and calcium phosphate mediated 
direct uptake. 

It is to be understood that this invention is intended to include other forms of 
5 expression vectors, host cells and transformation techniques which serve equivalent 
functions and which become known to the art hereto. 

The term „cDNA library" refers to a collection of clones, usually in a bacteriophage, 
or less commonly in bacterial plasmids, containing cDNA copies of mRNA sequences 
1 0 derived from a donor cell or tissue. 

In addition, the present invention contemplates recombinant polynucleotides, of 
about 15 to 20kb, preferably 10 to 15kb nucleotides in length, comprising a nucleic acid 
sequence derived from the DNA of the invention. 

15 

The term "polynucleotide" as used herein refers to a polymeric form of nucleotides 
of any length, either ribonucleotides or deoxyribonucleotides. This term refers only to the 
primary structure of the molecule. Thus, this term includes double- and single-stranded 
DNA, as well as double- and single-stranded RNA. It also includes modified, for example, 
20 by methylation and/or by capping, and unmodified forms of the polynucleotide. 

The term "derived from" a designated sequence, refers to a nucleic acid sequence 
that is comprised of a sequence of approximately at least 6-8 nucleotides, more preferably 
at least 10-12 nucleotides, and, even more preferably, at least 15-20 nucleotides that 

25 correspond to, i.e., are homologous or complementary to, a region of the designated 

sequence. The derived sequence is not necessarily physically derived from the nucleotide 
sequence shown, but may be derived in any manner, including for example, chemical 
synthesis or DNA replication or reverse transcription, which are based on the information 
provided by the sequences of bases in the region(s) from which the polynucleotide is 

30 derived. 

Further, the term "polynucleotide" is intended to include a recombinant 
polynucleotide, which is of genomic, cDNA, semisynthetic or synthetic origin which, by 
virtue of its origin or manipulation is not associated with all or a portion of the 
35 polynucleotide with which it is associated in nature and/or is linked to a polynucleotide 
other than that to which it is linked in nature. 
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The „native" version of PN4 ? and its splice variant PN4a, panially correspond to 
the sodium channel NaCh6 (described by Schaller et ai in J. Neurosci. 15, 3231-3242 
(1995)) as shown in Figs. 3 and 4. In Figs. 3 and 4. base pair and amino acid sequences of 
5 the native PN4 and PN4a sodium channel cDNA clones include untranslated sequences. 
The published NaCh6 sequence does appear to not correctly provide the sodium channel 
sequence, and the sequences for PN4 and PN4a appear to represent the authentic sodium 
channel sequence for the following reasons: 

10 First, most sodium channel gene coding regions, including PN4, begin with an eleven 

base pair sequence consisting of an out of frame ATG, followed by five base pairs 
downstream, followed by the ATG initiation codon for the coding region. The DNA 
sequence alignment (Fig. 3) shows a two base pair deletion in NaCh6 overlapping the 
second ATG, so that the normally out of frame, upstream ATG becomes the NaCh6 

1 5 initiation codon, leading to a two amino acid insertion. Start and stop codons are underlined 
and primers are denoted by dashed lines with arrows. 

Examination of the DNA sequence alignment (Fig. 3) shows that the bulk of the 
differences (residues in bold print) between the two sequences that would strongly 

20 influence protein function consist of a series of nine single base deletions in the Interdomain 
I/II region. These differences lead to a very different amino acid sequence, as shown in the 
amino acid alignment of Fig. 4, where the differences between the two sequences are again 
shown in bold print. The applicants' sequencing of multiple isolates resulting from the 
cloning of up to 1.5kb of the Interdomain I/II region by PGR repeatedly resulted in 

25 sequences which completely agreed with PN4 or PN4a sequences. 

Comparison of PN4 and PN4a sequences to other sodium channel sequences shows 
a high degree of homology. For example, Fig. 5 is a comparison of PN4 and PN4a with 
NaCh6 and rat Brain type II in this region. Whereas PN4 and Brainll share about 50% 
30 identity in the region highlighted in bold, NaCh6 is almost completely different. The 
differences between Brainll and PN4 are underlined. 



Also PCR was employed to look specifically for NaCh6. A sense primer common 
to both sequences (CAATCGTGGGCGCCCTAATC, corresponding to base pair 722-742 
35 of NaCh6 and shown by dashed lines with an arrow in Fig. 3 at bases 884-904 of PN4) was 
paired with gene specific antisense primers (TGCTTTCATGCACTGGAATCCCTCT, 
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corresponding to base pair 1 194-1 170 of PN4, and 

TGCTTTACTGCACTGGAATCCTTCG, corresponding to base pair 1029-1005 of 
NaCh6; sequence differences between the two primers are underlined). The antisense 
primers prime at a three base pair deletion of NaCh6 relative to PN4 and overlap three 
5 other sequence differences, as shown in Fig. 3. A PCR product of the expected size (about 
300 base pairs) was obtained with the PN4 specific antisense primer using pBK-CMV/75- 
1 .4 DNA (described in the description of SEQ ID NO:2) and with rat Brain and rat DRG 
cDNA templates. No PCR products were obtained from these templates with the NaCh6 
specific primer. 

10 

Any of the sequence differences between PN4 and NaCh6 could result in an 
inability of the NaCh6 gene to form a functional channel. However, some differences could 
be ascribed to „base calling." To verify the accuracy of the sequence the full length versions 
of PN4 and PN4a were sequenced. Of the amino acid differences between PN4 and NaCh6, 
1 5 it appears that the profound differences in the Interdomain I/II region are responsible for 
the lack of success in expression of the NaCh6 gene. The nine single base deletions in this 
region appear to shift the reading frame (see Figs. 3 and 4), leading to a „nonsense" peptide 
which lacks a number of highly conserved residues (Fig. 5) and which could sufficiently 
disrupt the structure of the protein to destroy its function. 

20 

The splice variant PN4a is similar to and occurs in a homologous position to that 
seen with rat type Brain 1 and la channels (Schaller et a/. t J. Neurosci 12, 1370-1381 
(1992)). In each case it appears that the variants make use of the same 3' splice acceptor 
sites but alternative 5' sites. Rat Brainlll also has splice variants in this region, using the 
25 same 3' splice site but using alternative 5' sites more 5' than the other channels. An amino 
acid comparison with other rat(r) and human(h) channels is shown below. Not all sodium 
channels have this splicing pattern. 
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It is interesting to note that rat PN1 is similar to PN4a whereas its human 
homologue, the neuroendocrine channel, hNE-Na, is similar to PN4. Perhaps each of these 
will be found to be one of a set of splice variants. Whereas the splicing patterns of BrainI, 
II, and III were found not to vary across a range of tissues (Schaller et ai. J. Neurosci 12, 
1370*1381 (1992)), PN4 and PN4a show dramatic abundance differences. PN4 has a 
gradient of expression with high expression in brain, intermediate in spinal cord and 
relatively the least in DRG. PN4a is very low or undetectable in brain, a minor fraction of 
total PN4 expression in spinal cord, and nearly as abundant as PN4 in DRG. 

Many uses of the invention exist, a few of which are described below. 



1 . Probe for human channel. 

As mentioned above, it is believed that homologs of the novel rat TTX-sensitive 
1 5 sodium channel described herein are also expressed in mammalian nerve tissue, in particular, 
human tissue. The entire cDNAs of PN4 and PN4a rat sodium channels of the present 
invention can be used as a probe to discover whether novel PN4 and PN4a voltage-gated, 
TTX-sensitive sodium channels exist in human nerve tissue and, if they do, to aid in 
isolating the cDNAs for the human protein. 

20 

The human homologues of the rat TTX-sensitive PN4 and PN4a channels can be 
cloned using a human DRG cDNA library. Human DRG are obtained at autopsy. The 
frozen tissue is homogenized and the RNA extracted with guanidine isothiocyanate 
(Chirgwin et aL, Biochemistry 18, 5294-5299 (1979)). The RNA is size-fractionated on a 

25 sucrose gradient to enrich for large mRNAs because the sodium channel a-subunits are 
encoded by large (7-1 1 kb) transcripts. Double-stranded cDNA is prepared using the 
Superscript Choice cDNA kit (GIBCO BRL) with either oligo(dT) or random hexamer 
primers. EcoRI adapters are ligated onto the double-stranded cDNA which is then 
phosphorylated. The cDNA library is constructed by ligating the double-stranded cDNA 

30 into the bacteriophage-lambda ZAP II vector (Stratagene) followed by packaging into phage 
particles. 

Phage are plated out on 150 mm plates on a lawn of XLI-Blue MRF' bacteria 
(Stratagene) and plaque replicas are made on Hybond N nylon membranes (Amersham). 
35 Filters are hybridized to rat PN4 and PN4a cDNA probes by standard procedures and 
detected by autoradiography or chemi luminescence. The signal produced by the rat PN4 
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and PN4a probes hybridizing to positive human clones at high stringency should be 
stronger than obtained with rat brain sodium channel probes hybridizing to these clones. 
Positive plaques are further purified by limiting dilution and re-screened by hybridization 
or PCR. Restriction mapping and polymerase chain reaction will identify overlapping 
5 clones that can be assembled by standard techniques into the full-length human homologue 
of rat PN4 and PN4a. The human clone can be expressed by injecting cRNA transcribed in 
vitro from the full-length cDNA clone into Xenopus oocytes, or by transfecting a 
mammalian cell line with a vector containing the cDNA linked to a suitable promoter. 

10 2. Probe for Obtaining Molecular Data. 

The polynucleotides of the invention can be bound to a reporter molecule to form a 
polynucleotide probe useful for Northern and Southern blot analysis and in situ 
hybridization. 



15 The term "reporter molecule" refers to a chemical entity capable of being detected 

by a suitable detection means, including, but not limited to, spectrophotometries 
chemiluminescent, immunochemical, or radiochemical means. The polynucleotides of this 
invention can be conjugated to a reporter molecule by techniques well known in the art. 
Typically the reporter molecule contains a functional group suitable for attachment to or 

20 incorporation into the polynucleotide. The functional groups suitable for attaching the 
reporter group are usually activated esters or alkylating agents. Details of techniques for 
attaching reporter groups are well known in the art. See, for example, Matthews et al., 
Anal. Biochem., 151, 205-209 (1985) and Engelhardt et aL, European Patent Application 
No. 0 302 175. 

25 

3. Antibodies Against PN4 and PN4a. 

The polypeptides of the invention are highly useful for the development of 
antibodies against PN4 and PN4a. Such antibodies can be used in affinity chromatography 
to purify recombinant sodium channel proteins or polypeptides, or they can be used as a 
30 research tool. For example, antibodies bound to a reporter molecule can be used in 

histochemical staining techniques to identify other tissues and cell types where PN4 and 
PN4a are present, or they can be used to identify epitopic or functional regions of the 
sodium channel protein of the invention. 
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The antibodies can be monoclonal or polyclonal and can be prepared by techniques 
that are well known in the art. Polyclonal antibodies are prepared as follows: an 
immunogenic conjugate comprising PN4, PN4a or a fragment thereof, optionally linked to a 
carrier protein, is used to immunize a selected mammal such as a mouse, rabbit, goat. etc. 
5 Serum from the immunized mammal is collected and treated according to known procedures 
to separate the immunoglobulin fraction. 

Monoclonal antibodies are prepared by standard hybridoma cell technology based 
on that reported by Kohler and Milstein in Nature 256, 495-497 (1975): spleen cells are 
1 0 obtained from a host animal immunized with the PN4 or PN4a protein or a fragment 

thereof, optionally linked to a carrier. Hybrid cells are formed by fusing these spleen cells 
with an appropriate myeloma cell line and cultured. The antibodies produced by the 
hybrid cells are screened for their ability to bind to expressed PN4 or PN4a proteins. 

1 5 A number of screening techniques well known in the art, such as, for example, 

forward or reverse enzyme-linked immunosorbent assay screening methods may be 
employed. The hybrid cells producing such antibodies are then subjected to recloning and 
high dilution conditions in order to select a hybrid cell that secretes a homogeneous 
population of antibodies specific to either the PN4 or PN4a protein. 

20 

In addition, antibodies can be raised by cloning and expressing nucleotide sequences 
or mutagenized versions thereof coding at least for the amino acid sequences required for 
specific binding of natural antibodies, and these expressed proteins used as the immunogen. 
Antibodies may include the complete immunoglobulin or a fragment thereof. Antibodies 
25 may be linked to a reporter group such as is described above with reference to 
polynucleotides. 

4. Therapeutic Targets for Disorders. 

The present invention also includes the use of the novel voltage-gated, TTX- 
30 sensitive sodium channel a-subunit as a therapeutic target for compounds to treat disorders 
of the nervous system including, but not limited to, epilepsy, stroke injury, brain injury, 
allodynia, hyperalgesia, diabetic neuropathy, traumatic injury and AIDS-associated 
neuropathy. The invention allows for the manipulation of genetic materials by recombinant 
technology to produce polypeptides that possess the structural and functional 
35 characteristics of the novel voltage-gated, TTX-sensitive sodium channel a-subunit found 
in nerve tissue, particularly in sensory nerves. Site directed mutagenesis can be used to 
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provide such recombinant polypeptides. For example, synthetic oligonucleotides can be 
specifically inserted or substituted into the portion of the gene of interest to produce genes 
encoding for and expressing a specific mutant. Random degenerate oligonucleotides can also 
be inserted and phage display techniques can be used to identify and isolate polypeptides 
possessing a functional property of interest. 

5. Designing Therapeutics based on Inhibiting PN4 and PN4a and assays thereof. 

The present invention is also directed to inhibiting the activity of PN4 in brain, 
spmal cord, DRG, nodose ganglia and superior cervical ganglia tissues. This invention is 
also directed to inhibiting the activity of PN4a in spinal cord and DRG tissues. However, 
it is to be understood that further studies may reveal that PN4 and PN4a are present in 
other tissues, and as such, those tissues can also be targeted areas. For example, the 
detection of PN4 mRNA in nodose ganglia suggests that PN4 may conduct TTX-sensitive 
sodium currents in this and other sensory ganglia of the nervous system. 



In addition, it has been found that proteins not normally expressed in certain 
tissues, are expressed in a disease state. Therefore, this invention is intended to encompass 
the inhibition of PN4 and PN4a in tissues and cell types where the protein is normally 
expressed, and in those tissues and cell types where the protein is only expressed during a 
20 disease state. 

The invention also pertains to an assay for inhibitors of the novel TTX-sensitive 
sodium channel protein comprising contacting a compound suspected of being an inhibitor 
with expressed sodium channel and measuring the activity of the sodium channel. The 
25 compound can be a substantially pure compound of synthetic origin combined in an 

aqueous medium, or the compound can be a naturally occurring material such that the assay 
medium is an extract of biological origin, such as, for example, a plant animal, or microbial 
cell extract. PN4 and PN4a activity can be measured by methods such as 
electrophysiology (two electrode voltage clamp or single electrode whole cell patch clamp), 
guanidinium ion flux assays and toxin-binding assays. An "inhibitor" is defined as generally 
that amount that results in greater than 50% decrease in PN4 or PN4a activity, preferably 
greater than 70% decrease in PN4 or PN4a activity, more preferably, greater than 90% 
decrease in PN4 or PN4a activity. 

35 6 - Designing and Screening for Additional Therapeutics. 
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Another significant characteristic of PN4 is that it is TTX-sensitive. It is believed 
that TTX-sensitive sodium channels play a key role in transmitting nerve impulses relating 
to sensory inputs such as pain and pressure. This will also facilitate the design of 
therapeutics that can be targeted to a specific area such as nerve tissue. 

5 

Additionally, the recombinant protein of the present invention can be used to screen 
for potential therapeutics that have the ability to inhibit the sodium channel of interest. In 
particular, it would be useful to inhibit selectively the function of sodium channels in nerve 
tissues responsible for transmitting pain and pressure signals without simultaneously 
10 affecting the function of sodium channels in other tissues such as heart and muscle. Such 
selectivity would allow for the treatment of pain without causing side effects due to cardiac 
or neuromuscular complications. Therefore, it would be useful to have DNA sequences 
coding for sodium channels that are selectively expressed in nerve tissue. 

15 7. Pain Reliever. 

Sodium channels in nerve tissue play a large role in the transmission of nerve 
impulses, and therefore are instrumental in understanding neuropathic pain transmission. 
Neuropathic pain falls into two categories: allodynia, where a normally non-painful 
stimulus becomes painful, and hyperalgesia, where a usually normal painful stimulus 

20 becomes extremely painful. The ability to inhibit the activity of these sodium channels, 
i.e., reduce the conduction of nerve impulses, will affect the nerve's ability to transmit pain. 
Selective inhibition of sodium channels in sensory neurons such as dorsal root ganglia will 
allow the blockage of pain impulses without complicating side effects caused by inhibition 
of sodium channels in other tissues such as brain and heart. In addition, certain diseases are 

25 caused by sodium channels that produce impulses at an extremely high frequency. The 
ability to reduce the activity of the channel can then eliminate or alleviate the disease. 
Accordingly, potential therapeutic compounds can be screened by methods well known in 
the art, to discover whether they can inhibit the activity of the recombinant sodium channel 
of the invention. See Barram et al, Naun-Schmiedeberg's archives of Pharmacology, 347, 

30 125-132 (1993) and McNeal et a/., J. Med. Chem., 28, 381-388 (1985). For similar studies 
with the acetyl choline receptor, see, Claudio et al, Science 238, 1688-1694 (1987). 

Accordingly, the present invention encompasses a method of alleviating pain by 
inhibiting the activity of the novel TTX-sensitive sodium channel comprising administering 
35 a therapeutically effective amount of a compound having an IC 50 in the range of 0. 1 -50 nM, 
preferably within the range of 1-25 nM. and most preferably within the range of 1-5 nM. 
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Potential therapeutic compounds are identified based on their ability to inhibit the activity 
of PN4 and PN4a. Therefore, the aforementioned assay can be used to identify compounds 
having a therapeutically effective IC 50 . 

The term „IC S0 " refers to the concentration of a compound that is required to inhibit 
by 50% the activity of expressed PN4 or PN4a when activity is measured by 
electrophysiology, flux assays and toxin-binding assays, as mentioned above. 

The basic molecular biology techniques employed in accomplishing features of this 
invention, such as RNA, DNA and plasmid isolation, restriction enzyme digestion, 
preparation and probing of a cDNA library, sequencing clones, constructing expression 
vectors, transforming cells, maintaining and growing cell cultures and other general 
techniques are well known in the art, and descriptions of such techniques can be found in 
general laboratory manuals such as Molecular Cloning: A Laboratory Manual by Sambrook, 
et al. (Cold Spring Harbor Laboratory Press, 2nd edition, 1989). Accordingly, the 
following examples are merely illustrative of the techniques by which the invention can be 
practiced. 

BRIEF DFSCRTPTTOM QF THF. SFO ID'S AND FTOTfRFfs 

SEQ ID NO:l depicts an engineered version of the nucleotide cDNA sequence 
encoding the rat TTX-sensitive peripheral nerve sodium channel type 4 („PN4"). This 
version lacks most of the untranslated sequences, thereby comprising a 5934-base open 
reading frame, from nucleotide residue 22 of the Xhol-Hindlll clone, the start site of 
translation, and ending at residue 5956. 

SEQ ID NO:2 depicts an engineered version of the nucleotide cDNA sequence 
encoding the rat TTX-sensitive peripheral nerve sodium channel type 4a („PN4a"). This 
version lacks most of the untranslated sequences, thereby comprising a 5964-base open 
reading frame, beginning at nucleotide residue 22 of the Xhol-Hindlll clone, the start site of 
translation, and ending at residue 5986. The 30 base pair insert is found at positions 2014- 
2043. 

Fig. 1 (SEQ ID NO:3) depicts the deduced amino acid sequence of PN4, represented 
in the single-letter amino acid code. Shown in Fig. 1 are the homologous domains (I-IV); the 
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putative transmembrane segments (SI-S6); the amino acid conferring sensitivity to TTX 
(A); potential cAMP-phosphorylation site (•); and potential N-glycosylation site (♦). 

Fig. 2 (SEQ ID NO:4) depicts the deduced amino acid sequence of PN4a, 
5 represented in the single-letter amino acid code. Shown in Fig. 2 are the homologous 
domains (I-IV); the putative transmembrane segments (S1-S6); the amino acid conferring 
sensitivity to TTX (A); potential cAMP-phosphorylation site (•); and potential N- 
glycosy lation site ( ♦ ). 

1 0 Fig. 3 aligns the base pair sequences of the NaCh6 and the „native" version of the 

PN4 sodium channel cDNA clones (SEQ ID NO:7), including untranslated sequences, 
depicting the differences in bold. Start and stop codons are underlined and primers are 
denoted by dashed lines with arrows. 

1 5 Fig. 4 aligns the amino acid sequences of the PN4a, PN4 and NaCh6 sodium channel 

cDNA clones of Fig. 3, depicting the differences in bold. 

Fig. 5 is a comparison of the conserved region Interdomain I/II between PN4a, PN4, 
NaCh6 and Brainll sodium channels. Differences between PN4 (and PN4a) and NaCh6 are 
20 shown in bold type and differences between Brainll and PN4 are underlined. 

SEQ ID NO:5 depicts the 696 nucleotide cDNA sequence encoding the novel probe 
CNaD4-2 used to identify the novel sodium channels of the invention. 

25 SEQ ID NO:6 depicts the deduced amino acid sequence of probe CNaD4-2, 

represented in the single-letter amino acid code. 

Fig. 6 depicts the cloning map of PN4 and PN4a. 

30 Fig. 7 shows the properties of currents produced in Xenopus oocytes by injection 

of PN4 cRNA. Fig. 7a shows the current produced by sodium channels expressed in 
oocyte, Fig. 7b shows the current-voltage relationship. 

Fig. 8a and 8b show steady state inactivation of sodium currents produced by PN4 
35 in Xenopus oocytes. 
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Fig. 9 demonstrates the effects of the 13, and B 2 subunits upon PN4 function in Xenopus 
oocytes. Fig. 9a shows currents produced when the PN4 I subunit is injected alone; Fig. 9b 
is with the 6, subunit; Fig. 9c is with the 6 2 subunit; and Fig. 9d is with both the & x and fi 2 
subunits. 

5 

Abbreviations 

BSA bovine serum albumin 

Denhardt's solution 0.02% BSA, 0.02% polyvinylpyrrolidone, 0.02% Ficoll 



10 (0-1 g BSA, 0.1 g Ficoll and 0.1 g polyvinylpyrrolidone per 

500 ml) 

DRG dorsal root ganglia 

EDTA Ethylenediaminetetraacetic acid, tetrasodium salt 

ME N 20 mM MOPS, 1 mM EDTA, 5 mM sodium acetate, pH 

15 7.0 

MOPS 3-(N-morpholino)propanesulfonic acid (Sigma Chemical 

Company) 

PN3 peripheral nerve sodium channel type 3 

PNS peripheral nervous system 

20 SDS sodium dodecyl sulfate 

SSC 150 mM NaCl, 15 mM sodium citrate, pH 7.0 

SSPE 80 mM NaCl, 10 mM sodium phosphate, 1 mM 

ethylenediaminetetraacetate, pH 8.0 

TEV two electrode voltage clamp 

25 TTX tetrodotoxin (Sigma Chemical Company) 

UTR untranslated region 



EXAMPLES 

Each step employed in obtaining the DNA of the novel sodium channel of the 
invention is described in the detailed examples below. The following is an overview of the 
steps. Example 1 describes how a novel probe, CNaD4-2, was obtained by designing 
primers based on known sodium channels. Example 2 describes the construction and 
screening of a cDNA library with CNaD4-2 to obtain the 3' end of the novel sodium 
channel of the invention. Then, a known primer was employed to obtain the 5' end of the 
DNA of the invention. Example 3 describes how RT-PCR was employed to span the gap, 
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between the 3' and 5' ends obtained from the cDNA library. This resulted in a 798 base 
pair sequence and a splice variant thereof, having a 828 base pair sequence. Example 4 
describes assembling the clones into two full-length cDNA clones in expression vectors. 
The cloning map is illustrated in Fig. 6. Example 5 discusses the tissue distribution and 
5 localization accomplished by RT-PCR. Example 6 discusses the northern analysis of 
mRNA. Example 7 discloses obtaining expression data from Xenopus oocytes, and 
localization by RT-PCR. 

Materials 

1 0 The plasmid pBK-CMV was obtained from Stratagene (La Jolla, CA); plasmid 

Litmus 29 was obtained from New England Biolabs (Beverly, MA); the oocyte expression 
vector plasmid pBSTAcIIr was constructed from pBSTA (described by Goldin et al, in 
Methods in Enzymology (Rudy & Iverson, eds.) 207, 279-297) by insertion of a synthetic 
oligonucleotide linker; the mammalian cell expression vector plasmid pCI-neo was obtained 

1 5 from Promega (Madison, WI); plasmid pCRII was obtained from Invitrogen, San Diego, 
CA. Competent E. Coli cell lines STBL2™ and SURE® were obtained from GIBCO/BRL 
and Stratagene, respectively. 

EXAMPLE 1 : Identification of a novel channel fragment 

20 A novel probe used to identify the novel sodium channels was obtained as follows. 

Degenerate oligonucleotide primers were designed based on the homologies between known 
sodium channels in domain IV and used to perform RT-PCR on RNA isolated from rat 
DRG. The domain IV PCR products were cloned into pCRII, transformed into E. coli and 
single colonies isolated. DNA sequence of the inserts of several of these colonies was 

25 obtained, including the following novel sequence from clone pCRII/CNaD4-2 of SEQ ID 
NO:5, identified as CNaD4-2. SEQ ID NO:6 depicts the deduced amino acid sequence of 
probe CNaD4-2, represented in the single-letter amino acid code. 

CNaD4-2 can be made with standard PCR techniques. 

30 

EXAMPLE 2: Construction and screening of cDNA library from rat DRG with probe 
CNaD4-2 

EcoRI -adapted cDNA was prepared from normal adult male Sprague-Dawley rat 
DRG poly(A)+ RNA using the Superscript Choice System (GIBCO BRL). cDNA (>4kb) 
35 was selected by sucrose gradient fractionation as described by Kieffer (Gene 109, 115-119 
(1991)). The cDNA was then ligated into the Zap Express vector (Stratagene), and 
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packaged with the Gigapack II XL lambda packaging extract (Stratagene). Plate lysates 
were prepared and screened by PCR using CNaD4-2 specific primers 
(ACACTCAGAGCAAGCAGATGG and TCCCTGGGTGCTCTTTGTCCA. 
corresponding, to bases 32 to 52 and 569 to 589 of SEQ ID NO:5, respectively). Phage 
5 from one positive lysate were screened by filter hybridization with a 32 P-labeled probe (the 
700 base pair EcoRJ insert from CNaD4-2). Filters were hybridized in 50% formamide, 5X 
SSPE, 5X Denhardf s solution, 0.5% SDS, 250ug/ml sheared, denatured salmon sperm 
DNA, and 50mM sodium phosphate at 42°C and washed in 0.5X SSC, 0. 1% SDS at 50°C. 
Positive clones were excised in vivo into pBK-CMV using the ExAssist/XLOLR system 
10 (Stratagene). 

Approximately 95% of these clones contained sodium channel sequence under 
standard screening stringency conditions. The number of clones that are retrieved that 
contain sodium channel sequence can be increased with increased stringency conditions and 
15 careful analysis and interpretation of data. It is well known in the art when screening for a 
particular type of DNA sequence, other types of DNA sequences will also be hybridized, 
depending on the specificity of the probe. Here, with the careful designed probe of the 
invention, the approximate 95% „hit" rate, makes this fragment an exceptionally good 
sodium channel probe. 

20 

One of these clones, pBK-CMV/PN4.10-l, contained sequence of the CNaD4-2 
channel from domain II through the 3' UTR. The position of the pBK-CMV/PN4.10-l 
fragment in the PN4 and PN4a cloning map is shown in Fig. 6. In Fig. 6, ATG is the start 
codon, TAC is the stop codon and V is the position of the PN4a splice insert. 

25 

A degenerate primer designed for sodium channels in domain I 
(ACCAACTG[T/C]GT[G/A]TT[T/C]ATGAC) was paired with a PN4 specific primer 
from the domain II region of pBK-CMV/PN4.10-l (CAGCAGCTACAGTGGCTACA). 
These primers amplified a ca 1 .5kb fragment from rat brain and from rat DRG which was 

30 shown by sequencing to represent much of the 5' end of PN4, thus verifying that the 

primers would work for screening the library. The primers were then used to screen plate 
lysates of the DRG cDNA library by PCR. Positive lysates were plated and individual 
plaques picked and screened by PCR using the same primers. Positive clones were excised 
in vivo into pBK-CMV using the ExAssist/XLOLR system. One of these, pBK-CMV/75- 

35 1 .4, was found to contain PN4 sequence from the 5' UTR to the interdomain I/II region, but 
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not to domain II, possibly due to rearrangement during the excision process. The position 
of the pBK-CMV/75-1 .4 fragment in the PN4 and PN4a cloning map is shown in Fig. 6. 

EXAMPLE 3: Cloning the interdomain I/IT region 
5 The gap between pBK-CMV/75-1.4 and pBK-CMV/PN4.10-l was cloned by RT- 

PCR on rat DRG and brain total RNA using specific primers: 

AAAGAGGCCGAGTTCAAGGC (a base pair sequence of pBK-CMV/75-1 .4) and 
TGTCCTTCCGTCCGTAGG (a base pair sequence of pBK-CMV/PN4.10-l). PCR 
products were cloned into plasmid pCRII and sequenced. Two distinct sequences ; FA-2 

10 and FA-7 (see Fig. 6), were cloned from DRG. These were found to be identical except for 
the presence of a 30 base pair insert (found at base pairs 2014-2043 in SEQ ID NO:2, and 
depicted by an upside triangle at the position of insertion in FA-7, FJ-13 and PN4a in Fig. 
6). with sequence identity to pBK-CMV/75-1.4 and pBK-CMV/PN4.10-l in the regions 
where they overlap. RT-PCR on rat brain RNA yielded only clones which lacked the 30 

1 5 base pair insert. This insert is homologous to a splice variant of the NaChI channel 

(NaChla) and likely results from alternative 5' splice site usage (Schaller et ai f J. Neurosci 
12, 1370-1381 (1992)). 

Additional RT-PCR was performed on rat DRG RNA using primers 
20 TTCATGGGGAACCTTCGAAAC (a base pair sequence of pBK-CMV/75-1.4) and 

GAACGATGCAGATGGTGATGGCTAA (a base pair sequence of pBK-CMV/PN4.10- 
1). The 1.5kb PCR product was cloned into pCRII; six out of twenty isolates were 
positive for the 30 base pair insert variant by PCR. The sequence obtained for one of 
these, FJ-13, position shown in Fig. 6, was identical to that expected from the sequences of 
25 pBK-CMV/75-1.4, FA-7, and pBK-CMV/PN4.10-l, thus confirming that these clones all 
originated from the same transcript. 

EXAMPLE 4: Assembly of full-length PN4 clones i n expression vectors 

Unsuccessful attempts have been made to create and stabilize full-length sodium 

30 channel cDNA sequences. In US Patent No. 5,380,836, the cDNA sequence for a rat 

cardiac sodium channel protein was contained in three separate plasmids. In order to create 
full-length functional PN4 genes, the 5' end was modified: suitable restriction sites were 
added and the upstream out-of-frame initiation codon was removed. The modified pBK- 
CMV/75-1.4 and FA-2 sequences were fused together, then combined with the remaining 

35 portion of PN4 from pBK-CMV/PN4.10-l in suitable expression vectors. PCR was 
employed to assemble the 5' portion of PN4 from the initiation codon to domain II. A 
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1.43kb PCR fragment was generated from pBK-CMV/75-1 .4 using the following primers: 
(1) GAAGCTCGAGCCCGGGCAAGAGAAGATGGCAGCGCGG (Xho-I Srf-I restriction 
sites underlined, initiation codon in bold, PN4 homology in italics, a base pair sequence of 
pBK-CMV/75-1.4) and primer (2) CTCGGAGAGCCTACCCCATC (a base pair 
5 sequence of pBK-CMV/75-1.4, and a base pair sequence of FA-2). A 0.69kb PCR 

fragment was generated from FA-2 using primer (3) AGAAGGGGAAGATGGGGTAGG 
(a base pair sequence of FA-2, and a base pair sequence of pBK-CMV/75- 1 .4) and primer 
(4) ATTCTGTCCTTCCGTCCGTAG (a base pair sequence of FA-2, and a base pair 
sequence of pBK-CMV/PN4. 10-1). These fragments were gel purified and then a small 

1 0 fraction of each was combined as template in a further PCR reaction using primers ( 1 ) and 
(4). The fragments share a 31 base pair region of overlap at their 3' and 5' ends 
respectively, and therefore can act as primers to fuse the two fragments together (Horton et 
al. Gene 77, 61-68 (1991)). The 2.1kb PCR product was cloned into pCRII and several 
isolates were sequenced, one of which, FD-8, had the expected sequence. The position of 

1 5 FD-8 in the PN4 and PN4a clones is shown in the cloning map of Fig. 6. 

To facilitate cloning into pBSTA and pCI-neo, it was determined to introduce an 
Xbal site at the 3' end. To accomplish this, the PN4 domain II to 3' UTR region was 
subcloned from pBK-CMV/PN4.10-l from the EcoRI site of the vector to the Hindlll site 
20 14 base pairs from the PN4 stop codon into EcoRI plus Hindlll digested Litmus 29. The 
resulting clone was labeled FC-1 . The position of FC-1 in the PN4 and PN4a clones is 
shown in the cloning map of Fig. 6. 

To assemble the full length PN4, the 5' portion was subcloned from FD-8 as a 
25 2.0kb Xho I-Eco NI fragment together with the 3' portion from FC-1 as a 4.0kb Eco NI- 
Xba-I fragment into Xho-I plus Xba-I digested pBSTAcIIr. One of the resulting isolates 
was found to have the correct sequence and was named pBSTAcIIr_PN4(FU-7A). 

The splice variant, PN4a, was assembled by replacing the 1.3kb Sph I - Acc I region 
30 of pB ST AcIIr_PN4(FU-7 A) with the corresponding fragment from FJ-1 3, to form 
pBSTAcIIr_PN4a(FZA-3), and confirmed by DNA sequencing. 

PN4 and PN4a were recloned into pCI-neo as 6.0kb Xho-I to Xba-I fragments to 
form pCI-neo-PN4(GAII-l) and pCI-neo-PN4a(GCII-2), respectively, and confirmed by 
DNA sequencing. The sequences of the coding regions as cloned in the oocyte and 



35 
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mammalian cell expression vectors of PN4 and PN4a are SEQ ID NO: 1 and SEQ ID NO:2, 
respectively. 

Growth of fragments of PN4 or PN4a was accomplished under standard conditions, 

5 however growth of plasmids containing full length constructs of PN4 and PN4a (in pCIneo 
or pBSTAcIIr) could not be accomplished without use of special growth media, conditions, 
and E. coli strains. The following proved to be optimal: (1) use of E. coli STBL2™ for 
primary transformation following ligation reactions; for large scale culturing the primary 
transformants in STBL2™ cells were used, but secondary transformants in SURE® cells 

10 were used later if necessary. These E. coli strains have altered genotypes which allow the 
stable propagation of plasmids containing unstable inserts. (2) Solid media was l/2x FM 
(see below) plus either lx YENB (Bacto Yeast Extract, 0.75%, Bacto Nutrient Broth, 0.8%; 
Sharma and Schimke, Biotechniques 20, 42-44 (1996)), lx YET (Bacto Yeast Extract, 
0.75%, Bacto Tryptone, 0.8%), or lx LB (Tryptone, 1%, Yeast Extract, 0.5%, NaCl, 

15 0.5%), plus 15g/L agar. (3) Liquid media optimally was lx FM plus l/2x LB. (4) 
Carbenicillin, 100|ig/ml, was used for all media, as it is metabolized less rapidly than 
ampicillin. However, carbenicillin may be used within the range of 50-200 ^.g/ml; and more 
preferably within the range of 75-125 u.g/ml. (5) Temperature for growth should be no 
greater than 30°C, usually 28°C; this necessitated longer growth periods than normally 

20 employed, from 36 to 48 hours. 

The recipe for 2x Freezing Medium (2xFM) is K2HP04, 12.6g; Na3Citrate, 0.9g; 
MgS04.7H20, 0.1 8g; (NH4)2S04, 1.8g; KH2P04, 3.6g; Glycerol, 88g; H20, qs to 1L. 

25 2xFM and the remaining media components are prepared separately, sterilized by 

autoclaving, cooled to at least 60°C, and added together to form the final medium. 
Carbenicillin is prepared at 25mg/ml H20 and sterilized by filtration. 2xFM was first 
described for preparation of frozen stocks of bacterial cells (Practical Methods in 
Molecular Biology, Schleif, R.F. and Wensink, P.C., Springer-Verlag, New York (1981) 

30 pp20 1-202). 

EXAMPLE 5: Tissue distribution bv RT-PCR 

Brain, spinal cord, DRG, nodose ganglia, superior cervical ganglia, sciatic nerve, 
heart and skeletal muscle tissue were isolated from anesthetized, normal adult male 
3 5 Sprague-Dawley rats and were stored at -80°C. RNA was isolated from each tissue using 
RNAzol (Tel-Test, Inc.). Random-primed cDNA was reverse transcribed from 500ng of 
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RNA from each tissue. The CNaD4-2 specific primers 

ACACTCAGAGCAAGCAGATGG and TCCCTGGGTGCTCTTTGTCCA (see above) 
defined a 558 base pair amplicon and would not discriminate between PN4 and PN4a. 
Thermal cycler parameters were 30 s/94°C, 30 s/64°C, 1 min/72°C (24 cycles (confirmation 
5 experiment: 34 cycles)); 30 s/94°C, 30 s/64°C 5 min/72°C ( 1 cycle). A positive control 
(pCRII/CNaD4-2) and a no-template control were also included. cDNA from each tissue 
was also PCR amplified using primers specific for glyceraldehyde-3 -phosphate 
dehydrogenase to demonstrate template viability, as described by Tso et al. Nucleic Acid 
Res. 13,2485-2502(1985). 

0 

Tissue distribution profile of PN4 by analysis of RNA from selected rat tissues by 
RT-PCR was as follows: 



Tissue RT-PCR f35 cycles'! 

1 5 Brain +++++ 

Spinal cord +++ 

DRG ++ 

Nodose ganglia ++ 

Superior cervical ganglia + 

20 Sciatic nerve 

Heart 

Skeletal muscle 



PN4 was also detected after only 25 cycles (24 + 1 ) in the same five tissues as 
25 above in the same relative abundance. 



Since PN4 differs from PN4a by only 30 base pairs, a new sense primer, 
GGTGGACTGCAACGGCGTA (corresponding to the same base pair sequences of FA-2 
and FA-7), was employed. RT-PCR using this primer together with primer 

30 ATTCTGTCCTTCCGTCCGTAG (primer 4 above) gave amplicons of 1 59 base pairs 
from PN4 and 189 base pairs from PN4a. Thermal cycler parameters were 1 min/95°C; 
20sec/94°C, 30 sec/60°C, 1 min/72°C, 8 cycles; 20sec/94°C, 30 sec/58°C, 1 min/72°C, 27 
cycles; 3 min/72°C. PN4a was nearly as abundant as PN4 in DRG, much less abundant 
than PN4 in spinal cord, and almost undetectable in brain. This correlates well with cloning 

35 data; based on sequenced, cloned RT-PCR fragments which included the 30 base pair insert 
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region, PN4a was found in 40% of isolates from DRG (9/24), but not found from brain 
(0/4). 

Tissue 

5 

Brain 

Spinal cord 
DRG 

10 EXAMPLE 6: Northern Analysis of mRNA from rat DRG 

Lumbar DRG #4 and #5 (L4 and L5), brain and spinal cord were removed from 
anesthetized adult male Sprague-Dawley rats under a dissecting microscope. The tissues 
were frozen in dry ice and homogenized with a Polytron homogenizer; the RNA was 
extracted by the guanidine isothiocyanate procedure (Chomczynksi et ai, Anal. 

15 Biochemistry 162, 156-159 (1987)). Total RNA (5 jig of each sample) was dissolved in 
MEN buffer containing 50% formamide, 6.6% formaldehyde and denatured at 65°C for 5- 
10 minutes. The RNA was electrophoresed through a 0.8% agarose gel containing 8.3% 
formaldehyde in MEN buffer. The electrode buffer was MEN buffer containing 3.7% 
formaldehyde; the gel was run at 50 V for 12-18 hour. 

20 

After electrophoresis, the gel was rinsed in 2xSSC and the RNA was transferred to a 
Duralose membrane (Stratagene) with 20xSSC by capillary action; the membrane was baked 
under vacuum at 80°C for 1 hour. The membrane was prehybridized in 50% formamide, 
5xSSC, 50 mM sodium phosphate, pH 7.1, lx Denhardt's solution, 0.5% SDS, and sheared, 

25 heat-denatured salmon sperm DNA (1 mg/ml) for 16 hour at 42°C. The membrane was 
hybridized in 50% formamide, 5xSSC, 50 mM sodium phosphate, pH 7.1, lx Denhardfs 
solution, 0.5% SDS, and sheared, heat-denatured salmon sperm DNA (200 |J.g/ml) with a 
32 P- labeled cRNA probe (ca. l-3xl0 6 cpm/ml). The probe was the cloned fragment, 
CNaD4-2, which contains the Domain 4 sequence of PN4 sodium channel ot-subunit 

30 sequence. The probe was hybridized for 18 hour at 42°C. The cRNA probe was 
synthesized by excising and subcloning the fragment into pBluescript KS+ vector, 
purchased from Stratagene. The cRNA was transcribed in vitro using T3 RNA 
polymerase, purchased from Promega, after linearizing the plasmid with Xbal, purchased 
from Boehringer Mannheim. Protocols for each procedure mentioned above can be found in 

35 Molecular Cloning: A Laboratory Manual by Sambrook et al (Cold Spring Harbor 
Laboratory Press, 2nd edition, 1989). 
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The membrane was washed three times with 2xSSC, 0. 1% SDS at room temperature 
for 20 minutes and then washed once with O.lxSSC, 0.1% SDS at 68°C for 30 minutes. 
The filter was exposed against Kodak X-omat AR film at -80°C with intensifying screens 
for up to two weeks. 

Size markers, including ribosomal 1 8S and 28S RNAs and RNA markers (GIBCO 
BRL), were run in parallel lanes of the gel. Their positions were determined by staining the 
excised lane with ethidium bromide (0.5 |ig/ml) followed by photography under UV light. 
The CNaD4-2 probe hybridized to RNA from the brain, cerebellum, dorsal and ventral 
horn of the spinal cord with sizes of 1 1 kb, 9.5 kb, 7.5 kb, and 6.5 kb, estimated on the 
basis of their positions relative to the standards. 

Bands of the same size were detected in a blot containing total RNA from DRG 
from neuropathic pain model. However, no signal was detected with RNA from naive 
DRG. 

PN4 constitutes a subfamily of novel sodium channel genes; these genes are 
different from those detectable with other probes (e.g., PEAF8 and PN3 probes), as 
discussed in copending application no. 08/51 1,828. Sequence comparison of PN4 with 
NaCh6 (mRNA size = 9.5kb) (Schaller et al. 9 J, Neurosci. 15, 3231-3242 (1995)) and 
cardiac-specific sodium channel for which only a partial sequence is available (mRNA size 
= 7kb) (Sills et a/., J. Clin. Invest. 84, 33 1-336 (1989)) indicates that these genes share a 
higher homology among themselves than with members of other sodium channel subfamilies 
such as the brain-type sodium channels, TTX-insensitive cardiac sodium channel and the 
TTX-resistant PN3 sodium channel. 

Semiquantitation of the signal intensity of the various bands detected in the blot 
containing RNAs from the neuropathic pain model indicated that the level of 7.5kb 
transcript was upregulated -35 fold as compared with the DRG from the sham operated 
side on day 1 after the surgery, wherein the sciatic nerve was ligated with four loose 
ligatures causing a constriction injury. None of the other transcripts detected by the 
CNaD4-2 probe was regulated so dramatically. By day 2, the regulation was reduced to -5 
fold as compared with the sham operated side. The experiment was performed with DRG 
pooled from 6 rats. 
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This experimental data suggests that PN4, or its splice variant, PN4a ? is involved in 
the pathophysiology of neuropathic pain. 

EXAMPLE 7: Expression of full length clone in Xenopus oocytes 
5 cRNA was prepared from PN4 subcloned into pBSTA using a T7 in vitro 

transcription kit (Ambion, mMessage mMachine) and was injected into stage V and VI 
Xenopus oocytes using a Nanojector (Drummond), as described in Goldin, supra. After 1.5 
days at 20°C, the oocytes were impaled with agarose-cushion electrodes (0.3-0.8 MOhm) 
and voltage-clamped with a Geneclamp 500 amplifier (Axon Instruments) in TEV mode. 
10 See Schreibmayer et al. % Pflugers Arch. 426, 453-458 (1994). 

Stimulation and recording were controlled by a computer running pClamp (Axon 
Instruments) (Kegel et al J. Neurosci. Meth. 12, 317-330 (1982)). Oocytes were 
perfused with a solution containing: 81 mM NaCl, 2 mM KC1, 1 mM MgCl 2 , 0.3 mM 
15 CaCl 2 , 20 mM Hepes-NaOH, pH 7.5. The data collected is shown in the Figs. 7-9 and 
described on the following pages. 

Fig. 7a shows the currents produced from a PN4 sodium channel expressed in a 
Xenopus oocyte using the Geneclamp P/-4 leak subtraction, filtered at 5 kHz with a 4-pole 
20 Bfcssel filter, and sampled at 50 kHz. Test pulses for TTX c/r were to 0 mV for 5 ms at 
0.033 Hz. The x-axis denotes time in milliseconds. 

Fig. 7b illustrates the voltage to current relationship of a PN4 sodium channel 
expressed in a Xenopus oocyte. 

25 

Fig. 8a and b show steady state inactivation of sodium currents produced by PN4 in 
Xenopus oocytes. In Fig. 8a the x-axis denotes time in milliseconds. In Fig. 8b the x-axis is 
conditioning potential in millivolts and the y-axis is current in TA. 

30 Fig. 9 demonstrates the effects of the B, and B 2 subunits upon PN4 function in 

Xenopus oocytes. Shown are currents produced when the PN4 I subunit is injected (a) 
alone; (b) with the B, subunit; (c) with the B 2 subunit; and (d) with both the R { and B 2 
subunits. The x-axis in each of these figures denotes time in milliseconds. As these figures 
show, the inactivation kinetics of a functionally active PN4 channel are accelerated by the 

35 Bi subunit. No obvious effects are seen with the B 2 subunit. 
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As is seen in Fig. 7a and b expression of PN4 and PN4a produced an inward current 
with slow inactivation kinetics, similar to that of the rBIIa (Patton et al. Neuron 7, 637- 
647 (1991)) and rSkMl a-subunits expressed in the absence of the p>subunit. In the 
expression of PN4, 0.2 ng of cRNA gave 1 .4 ± 0. 1 9 uA (n = 9). In the expression of PN4a, 
0.1 ng gave 1.8 ± 0.23 uA (n = 6). Co-injection of rCN(3, (1 ng/oocyte) with PN4 cRNA 
accelerated inactivation kinetics of the channel, as seen in Fig. 9a and b. This agreed with 
data obtained with rBIIa and hSCNp,. 

For steady state inactivation, 10 second prepulses were used. In the steady state 
inactivation of PN4, Vl/2 = -70.7 ± 0.71 mV, k = 5.5 ± 0.55 mV (n = 3). In the steady 
state inactivation of PN4a, Vl/2 = -73.3 ± 0.97 mV, k = 5.5 ± 0.28 mV (n = 4). Leak 
currents were measured during long pulses to -100 mV and -120 mV, and the test currents 
corrected assuming that the leak currents had a linear current-voltage relationship. An 
inactivation of -70 mV is similar to most sodium channels. 



3). 



TEVC activation data for PN4a was Vl/2 = -23 ± 2.7 mV, k = -6.4 ± 0.75 mV (n = 



TTX IC 50 was found to be 1 .0 ± 0.60 nM (n = 2). 



Sodium channels are distinctively sensitive or insensitive to neurotoxins such as 
TTX. The TTX-sensitive brain and skeletal muscle sodium channels are blocked by 
nanomolar TTX concentrations, whereas the TTX-insensitive cardiac sodium channels are 
blocked by micromolar TTX concentrations. In rat heart sodium channel 1 (rhl), Cys 374 is 
25 a critical determinant of TTX-insensitivity, as shown in Satin et al. , Science 256. 1 202- 

1205(1992); in the TTX-sensitive rBI, rBII, rBIII, and rSkMl, the corresponding residue is 
either Phe or Tyr. When expressed in Xenopus oocytes, the PN4 sodium current is 
sensitive to TTX (IC 50 > 1 nM). 



All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or patent 
application was specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity and understanding, it will be obvious that 
certain changes and modifications may be practiced within the scope of the appended 
claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 
5 (A) NAME: F. HOFFMANN-LA ROCHE AG 

(B) STREET: Grenzacherstrasse 124 

(C) CITY: Basle 

(D) STATE: BS 

(E) COUNTRY: Switzerland 

10 (F) POSTAL CODE (ZIP): CH-4002 

(G) TELEPHONE: 061 - 688 42 56 

(H) TELEFAX: 061 - 688 13 95 

(I) TELEX: 962292/965542 hlr ch 

1 5 (ii) TITLE OF INVENTION: Novel Cloned Tetrodotoxin-Sensitive Sodium 

Channel I-Subunit And A Splice Variant Thereof 

(iii) NUMBER OF SEQUENCES: 7 

(iv) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Floppy disk 

20 (B) COMPUTER: Apple Macintosh 

(C) OPERATING SYSTEM: System 7.1 (Macintosh) 

(D) SOFTWARE: Word 5.0 

(2) INFORMATION FOR SEQ ID NO: 1 : 
25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
30 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: rat 
35 (F) TISSUE TYPE: Dorsal root ganglia 

(G) CELL TYPE: Peripheral nerve 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

1 CTCGAGCCCG GGCAAGAGAA GATGGCAGCG CGGCTGCTCG CACCACCAGG CCCTGATAGT 

61 TTCAAGCCTT TCACCCCTGA GTCGCTGGCA AACATCGAGA GGCGTATTGC CGAGAGCAAG 

121 CTCAAGAAAC CACCAAAGGC GGATGGCAGC CACCGGGAGG ACGATGAAGA CAGCAAGCCC 

181 AAGCCAAACA GTGACCTGGA GGCTGGGAAG AGTTTGCCTT TCATCTACGG GGACATCCCG 

2 41 CAAGGCCTGG TTGCGGTTCC CCTGGAGGAC TTTGACCCTT ACTATTTGAC GCAGAAAACC 
301 TTTGTAGTAT TAAACAGAGG GAAAACTCTC TTCAGATTTA GTGCCACACC TGCCTTGTAC 

3 61 ATTTTAAGCC CTTTTAACCT GATAAGAAGA ATAGCTATTA AAATTTTGAT ACACTCAGTT 
421 TTCAGCATGA TCATCATGTG CACCATCCTG ACCAACTGTG TGTTCATGAC CTTTAGTAAC 
481 CCTCCAGAAT GGTC CAAGAA TGTGGAGTAC ACATTCACAG GGATTTACAC ATTTGAATCA 
541 CTAGTGAAAA TCATCGCAAG AGGTTTCTGC ATAGACGGCT TCACCTTCTT GCGAGACCCG 

6 01 TGGAACTGGT TAGACTTCAG TGTCATCATG ATGGCATATG TGACAGAGTT TGTGGACCTG 
661 GGCAATGTCT CAGCGCTGAG AACATTCAGG GTTCTCCGAG CTTTGAAAAC TATCTCTGTA 
721 ATTCCAGGCC TGAAGACAAT CGTGGGCGCC CTAATCCAGT CCGTGAAGAA GCTGTCGGAC 

7 81 GTGATGATCC TGACAGTGTT CTGCCTGAGT GTTTTCGCCC TGATTGGCCT GCAGCTCTTC 
841 ATGGGGAACC TTCGAAACAA GTGTGTCGTG TGGCCCATAA ACTTCAACGA GAGCTACCTG 
901 GAGAACGGCA CCAGAGGCTT TGACTGGGAG GAATATATCA ACAATAAAAC AAACTTTTAC 
9 61 ATGGTTCCTG GCATGCTAGA ACCCTTGCTC TGCGGGAACA GTTCTGATGC TGGGCAATGC 

1021 CCAGAGGGAT TCCAGTGCAT GAAAGCAGGA AGGAACCCCA ACTACGGTTA CACCAGCTTT 
1081 GACACCTTCA GCTGGGCCTT CTTGGCATTA TTCCGCCTTA TGACCCAGGA CTATTGGGAG 
1141 AACTTATACC AGCTGACCTT ACGAGCCGCT GGGAAAACGT ACATGATCTT CTTTGTCTTG 
12 01 GTCATCTTCG TGGGTTCTTT CTATCTGGTG AACTTGATCT TGGCTGTGGT GGCCATGGCT 

12 61 TATGAGGAAC AGAACCAGGC AAC AC TGG AG GAGGCAGAGC AAAAAGAGGC CGAGTTCAAG 
1321 GCAATGCTGG AGCAACTCAA GAAGCAGCAG GAGGAGGCAC AGGCTGCTGC AATGGCCACC 

13 81 TCAGCGGGCA CTGTCTCGGA AGACGCCATT GAAGAAGAAG GGGAAGATGG GGTAGGCTCT 
1441 CCGAGGAGCT CTTCTGAACT GTCTAAACTC AGTTCCAAGA GCGCGAAGGA GCGGCGGAAC 
1501 C G ACGGAAG A AGAGGAAGCA GAAGGAGCTC TCTGAAGGCG AGGAGAAAGG GGACCCGGAG 
1561 AAGGTGTTTA AGTCAGAGTC GGAAGACGGT ATGAGAAGGA AGGCCTTCCG GCTGCCAGAC 
1621 AACAGGATAG GGAGGAAGTT TTCCATCATG AATCAGTCGC TGCTCAGCAT TCCAGGCTCG 
1681 CCCTTCCTCT CCCGACATAA CAGCAAAAGC AGCATCTTCA GCTTCCGGGG ACCCGGTCGG 

17 41 TTCCGGGACC CCGGCTCTGA GAATGAGTTC GCAGACGATG AACACAGCAC CGTGGAGGAG 
1801 AGCGAGGGCC GGCGTGACTC GCTCTTCATC CCGATCCGCG CCCGCGAGCG CCGCAGCAGC 

18 61 TACAGTGGCT ACAGCGGCTA CAGCCAGTGC AGCCGCTCGT CGCGCATCTT CCCCAGCCTG 

19 21 CGGCGCAGCG TGAAGCGCAA CAGCACGGTG GACTGCAACG GCGTAGTGTC ACTCATCGGG 
19 81 CCCGGCTCAC ACATCGGGCG GCTCCTGCCT GAGGCAACGA CTGAGGTGGA AATTAAGAAG 
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2 041 AAAGGCCCTG GATCTCTTTT AGTTTCTATG GACCAACTCG CCTCCTACGG ACGGAAGGAC 
2101 AGAATCAACA GCATAATGAG CGTGGTCACA AACACGCTAG TGGAAGAGCT GGAAGAGTCT 
2161 CAGAGAAAGT GCCCACCGTG CTGGTATAAG TTTGCCAACA CTTTCCTCAT CTGGGAGTGT 
2 221 CACCCCTACT GGATAAAACT GAAGGAGATC GTGAACTTAA TCGTCATGGA CCCTTTTGTA 
5 2231 GACTTAGCCA TCACCATCTG CATCGTTCTG AATACGCTAT TTATGGCAAT GGAGCACCAT 
2 3 41 CCCATGACAC CACAGTTCGA ACACGTCTTG GCCGTAGGAA ATCTGGTGTT CACCGGGATC 
2 401 TTCACGGCGG AAATGTTTCT GAAGCTCATA GCCATGGACC CCTACTATTA TTTCCAAGAA 
2461 GGCTGGAACA TTTTTGACGG ATTTATTGTC TCCCTCAGTT TAATGGAGCT GAGTC TCGC A 
2 521 GATGTGGAGG GGCTCTCAGT GCTGCGGTCT TTCCGACTGC TCCGAGTCTT CAAGCTGGCC 

10 2 581 AAGTCCTGGC CCACCCTGAA CATGCTGATC AAGATCATCG GGAACTCCGT GGGTGCCCTG 
2 641 GGCAACCTGA CCCTGGTGCT GGCCATCATC GTCTTCATCT TCGCCGTGGT GGGGATGCAG 
2 7 01 CTGTTTGGAA AGAGTTACAA GGAGTGCGTC TGTAAGATCA ACCAGGAGTG CAAGCTCCCG 
2761 CGCTGGCACA TGAACGACTT CTTCCACTCC TTCCTCATCG TCTTCCGAGT GCTGTGTGGG 
2 321 GAGTGGATCG AGACCATGTG GGACTGCATG GAGGTGGCCG GCCAGGCCAT GTGCCTCATT 

15 2 881 GTCTTCATGA TGGTTATGGT CATTGGCAAC CTGGTGGTGC TGAATCTATT CCTGGCCTTG 

2 9 41 CTTCTGAGCT CCTTCAGCGC AGACAACCTG GCGGCCACAG ACGACGACGG GGAAATGAAC 

3 001 AACCTGCAGA TCTCAGTGAT CCGGATCAAG AAGGGCGTGG CCTGGACCAA AGTGAAGGTG 
3 0 61 CACGCCTTCA TGCAGGCTCA CTTCAAGCAG CGGGAGGCGG ATGAAGTGAA ACCCCTCGAC 
3121 GAGCTGTATG AGAAGAAGGC CAACTGCATC GCCAACCACA CGGGCGTGGA TATCCACCGG 

20 3181 AACGGCGACT TCCAGAAGAA CGGGAACGGA ACCACCAGCG GCATCGGCAG CAGCGTGGAG 
3 2 41 AAGTACATCA TCGACGAGGA CCACATGTCC TTCATTAACA ACCCAAACCT GACCGTCCGG 
3 3 01 GTGCCCATTG CTGTGGGCGA GTCTGACTTC GAGAACCTCA ACACAGAGGA TGTTAGCAGC 
3 3 61 GAATCAGACC CTGAAGGCAG CAAAGATAAA CTGGACGATA CCAGCTCCTC AGAAGGAAGT 
3 421 ACCATCGACA TCAAGCCTGA GGTGGAAGAA GTTCCCGTGG AGCAACCTGA GGAATACTTG 

25 3 481 GATCCGGACG CCTGCTTTAC AGAGGGTTGC GTCCAGCGGT TCAAGTGCTG CCAGGTCAAC 
3 541 ATCGAGGAAG GACTAGGCAA GTCGTGGTGG ATCTTGCGGA AAACCTGCTT CCTCATTGTG 
3 601 GAGCACAATT GGTTTGAGAC CTTCATCATC TTCATGATTC TGCTCAGCAG TGGCGCCCTG 
3 661 GCCTTTGAGG ACATCTACAT TGAGCAGAGG AAGACCATCC GCACCATCCT GGAGTATGCG 
3721 GACAAGGTCT TCACCTACAT CTTCATCCTG GAGATGTTGC TCAAGTGGAC AGCCTACGGC 

30 3781 TTCGTCAAGT TCTTCACCAA TGCCTGGTGC TGGTTGGACT TCCTCATTGT GGCTGTCTCT 
3 841 TTAGTCAGCC TTATAGCTAA TGCCCTGGGC TACTCGGAAC TAGGTGCCAT AAAGTCCCTT 
3 901 AGGACCCTAA GAGCTTTGAG ACCCTTAAGA GCCTTATCAC GATTTGAAGG GATGAGGGTG 

3 961 GTGGTGAATG CCTTGGTGGG CGCCATCCCC TCCATCATGA ATGTGCTGCT GGTGTGTCTC 

4 021 ATCTTCTGGC TGATTTTCAG CATCATGGGA GTTAACCTGT TTGCGGGGAA ATACCACTAC 
35 4 0 81 TGCTTTAATG AGACTTCTGA AATCCGGTTC GAAATCGATA TTGTCAACAA TAAAACGGAC 

4141 TGTGAGAAGC TCATGGAGGG CAACAGCACG GAGATCCGAT GGAAGAATGT CAAGATCAAC 
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42 01 TTTGACAATG TCGGAGCAGG GTACCTGGCC C TTCTTC AAG TGGCAACCTT CAAAGGCTGG 
4261 ATGGACATCA TGTATGCGGC TGTAGATTCC CGAAAGCCAG ACGAGCAGCC TGACTACGAG 
4 321 GGCAACATCT ACATGTACAT CTACTTCGTC ATCTTCATCA TCTTCGGCTC CTTCTTCACC 
4381 CTCAACCTGT TCATCGGTGT CATCATCGAC AACTTCAACC AGCAGAAGAA AAAGTTTGGA 
4441 GGTCAGGACA TCTTCATGAC AGAGGAACAG AAGAAGT AC T ACAATGCCAT GAAAAAGCTG 
4501 GGCTCCAAGA AGCCACAGAA GCCCATCCCC CGACCCTTGA ACAAAATCCA AGGGATTGTC 
4 561 TTTGATTTCG TCACTCAACA AGCCTTTGAC ATTGTGATCA TGATGCTCAT CTGCCTTAAC 
4 621 ATGGTGACAA TGATGGTGGA GACAGACACT CAGAGCAAGC AGATGGAGAA CATTCTTTAC 
4681 TGGATTAATC TGGTCTTTGT CATCTTCTTC ACCTGCGAGT GTGTGCTCAA AATGTTTGCC 
4741 TTGAGACACT ACTATTTCAC CATTGGCTGG AACATCTTTG ACTTTGTGGT GGTCATCCTC 

4 801 TCCATTGTGG GAATGTTCCT GGCTGATATC ATTGAGAAGT ACTTCGTCTC CCCAACCCTA 
4861 TTCCGAGTTA TCCGATTGGC CCGTATTGGG CGCATCTTGC GTCTGATCAA GGGCGCCAAA 
4921 GGGATCCGCA CCCTGCTCTT TGCCTTAATG ATGTCGCTGC CCGCCCTGTT CAACATCGGC 
49 81 CTCCTGCTCT TCCTCGTCAT GTTCATCTTC TCCATTTTTG GCATGTCCAA CTTCGCATAC 

15 5 041 GTGAAGCACG AGGCCGGCAT TGACGACATG TTCAACTTCG AGACATTTGG CAACAGCATG 
5101 ATCTGTTTGT TCCAGATCAC AACGTCTGCT GGCTGGGATG GCCTGCTGCT GCCAATCCTG 
5161 AACCGCCCCC CTGACTGCAG CTTGGACAAA GAGCACCCAG GGAGTGGCTT CAAAGGGGAC 
5221 TGTGGGAACC CCTCGGTGGG CATCTTCTTC TTTGTGAGCT ACATCATCAT CTCCTTCCTG 
5281 ATTGTGGTGA ACATGTACAT CGCCATCATC CTGGAGAACT TCAGCGTGGC CACCGAGGAG 

20 5341 AGCGCCGACC CTCTGAGTGA GGATGACTTC GAGACTTTCT ATGAGATCTG GGAGAAGTTT 
5401 GACCCAGACG CCACCCAGTT CATCGAGTAC TGTAAGCTGG CAGACTTTGC CGACGCCCTG 

5 461 GAGCACCCGC TCCGAGTACC CAAGCCCAAC ACCATCGAGC TCATCGCCAT GGACCTGCCC 
5 521 ATGGTGAGCG GAGATCGCAT CCACTGCTTG GACATCCTTT TCGCCTTCAC CAAGCGAGTC 
5581 CTGGGAGACA GTGGGGAGTT GGACATCCTG CGGCAGCAGA TGGAGGAGCG GTTCGTGGCA 

25 5 641 TCCAATCCTT CCAAAGTGTC TTACGAGCCT ATCACAACCA CTCTGCGGCG CAAGCAGGAG 
5701 GAGGTGTCTG CAGTGGTCCT GCAGCGTGCC TACAGGGGAC ACTTGGCTAG GCGGGGCTTC 
5761 ATCTGCAGAA AGATGGCCTC CAACAAGCTG GAGAATGGAG GCACACACAG AGACAAGAAG 
5 821 GAGAGC AC CC CGTCCACAGC CTCCCTCCCC TCTTACGACA GCGTCACAAA GCCAGACAAG 
5 8 81 GAGAAGCAGC AGCGTGCGGA GGAGGGCAGA AGGGAAAGAG CCAAGAGGCA AAAAGAGGTC 

30 5941 AGGGAGTCCA AGTGCTAGAG GAGGGGAAAG GAAGCTT 

(3) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 6007 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat 

(F) TISSUE TYPE: Dorsal root ganglia 

(G) CELL TYPE: Peripheral nerve 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

10 1 CTCGAGCCCG GGCAAGAGAA GATGGCAGCG CGGCTGCTCG CACCACCAGG CCCTGATAGT 

61 TTCAAGCCTT TCACCCCTGA GTCGCTGGCA AACATCGAGA GGCGTATTGC CGAGAGCAAG 
121 CTCAAGAAAC CACCAAAGGC GGATGGCAGC CACCGGGAGG ACGATGAAGA CAGCAAGCCC 
181 AAGCGAAACA GTGACCTGGA GGCTGGGAAG AGTTTGCCTT TCATCTACGG GGACATCCCG 

2 41 CAAGGCCTGG TTGCGGTTCC CCTGGAGGAC TTTGACCCTT ACTATTTGAC GCAGAAAACC 
15 3 01 TTTGTAGTAT TAAACAGAGG GAAAACTCTC TTCAGATTTA GTGCCACACC TGCCTTGTAC 

3 61 ATTTTAAGCC CTTTTAACCT GATAAGAAGA ATAGCTATTA AAATTTTGAT ACACTCAGTT 
421 TTCAGCATGA TCATCATGTG CACCATCCTG ACCAACTGTG TGTTCATGAC CTTTAGTAAC 

4 81 CCTCCAGAAT GGTCCAAGAA TGTGGAGTAC ACATTCACAG GGATTTACAC ATTTGAATCA 
541 CTAGTGAAAA TCATCGCAAG AGGTTTCTGC ATAGACGGCT TCACCTTCTT GCGAGACCCG 

20 601 TGGAACTGGT TAGACTTCAG TGTCATCATG ATGGCATATG TGACAGAGTT TGTGGACCTG 

661 GGCAATGTCT CAGCGCTGAG AACATTCAGG GTTCTC CG AG CTTTGAAAAC TATCTCTGTA 
721 ATTCCAGGCC TGAAGACAAT CGTGGGCGCC CTAATCCAGT CCGTGAAGAA GCTGTCGGAC 
7 81 GTGATGATCC TGACAGTGTT CTGCCTGAGT GTTTTCGCCC TGATTGGCCT GCAGCTCTTC 
841 ATGGGGAACC TTCGAAACAA GTGTGTCGTG TGGCCCATAA ACTTCAACGA GAGCTACCTG 
25 901 GAGAACGGCA CCAGAGGCTT TGACTGGGAG GAATATATCA ACAATAAAAC AAACTTTTAC 

961 ATGGTTCCTG GCATGCTAGA ACCCTTGCTC TGCGGGAACA GTTCTGATGC TGGGCAATGC 
1021 C C AG AGGG AT TCCAGTGCAT GAAAGCAGGA AGGAACCCCA ACTACGGTTA CACCAGCTTT 
1081 GACACCTTCA GCTGGGCCTT CTTGGCATTA TTCCGCCTTA TGACCCAGGA CTATTGGGAG 
1141 AACTTATACC AGCTGACCTT ACGAGCCGCT GGGAAAACGT ACATGATCTT CTTTGTCTTG 
30 12 01 GTCATCTTCG TGGGTTCTTT CTATCTGGTG AACTTGATCT TGGCTGTGGT GGCCATGGCT 
1261 TATGAGGAAC AGAACCAGGC AACACTGGAG GAGGCAGAGC AAAAAGAGGC CGAGTTCAAG 
1321 GCAATGCTGG AGCAACTCAA GAAGCAGCAG GAGGAGGCAC AGGCTGCTGC AATGGCCACC 
1381 TCAGCGGGCA CTGTCTCGGA AGACGCCATT GAAGAAGAAG GGGAAGATGG GGTAGGCTCT 
1441 CCGAGGAGCT CTTCTGAACT GTCTAAACTC AGTTCCAAGA GCGCGAAGGA GCGGCGGAAC 
35 1501 CGACGGAAGA AGAGGAAGCA GAAGGAGCTC TCTGAAGGCG AGGAGAAAGG GGACC CGGAG 
1561 AAGGTGTTTA AGTCAGAGTC GGAAGACGGT ATGAGAAGGA AGGCCTTCCG GCTGCCAGAC 
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1621 


AACAGGATAG 


GGAGGAAGTT 


TTCCATCATG 


AATCAGTCGC 


TGCTCAGCAT 


TCCAGGCTCG 


1681 


CCCTTCCTCT 


CCCGACATAA 


CAGCAAAAGC 


AGCATCTTCA 


GCTTCCGGGG 


ACCCGGTCGG 


1741 


TTCCGGGACC 


CCGGCTCTGA 


GAATGAGTTC 


GCAGACGATG 


AACACAGCAC 


CGTGGAGGAG 


1801 


AGCGAGGGCC 


GGCGTGACTC 


GCTCTTCATC 


CCGATCCGCG 


CCCGCGAGCG 


CCGCAGCAGC 


1861 


TACAGTGGCT 


AC AGCGGC TA 


CAGCCAGTGC 


AGCCGCTCGT 


CGCGCATCTT 


CCCCAGCCTG 


1921 


CGGCGCAGCG 


TGAAGCGCAA 


CAGCACGGTG 


GACTGCAACG 


GCGTAGTGTC 


ACTCATCGGG 


1981 


CCCGGCTCAC 


ACATCGGGCG 


GCTCCTGCCT 


GAGGTGAAAA 


TAGATAAGGC 


AGCTACGGAC 


2041 


AGCGCAACGA 


CTGAGGTGGA 


AATTAAGAAG 


AAAGGCCCTG 


GATCTCTTTT 


AGTTTCTATG 


2101 


GACCAACTCG 


CCTCCTACGG 


ACGGAAGGAC 


AGAATCAACA 


GCATAATGAG 


CGTGGTCACA 


2161 


AACACGCTAG 


TGGAAGAGCT 


GGAAGAGTCT 


CAGAGAAAGT 


GCCCACCGTG 


CTGGTATAAG 


2221 


TTTGCCAACA 


CTTTCCTCAT 


CTGGGAGTGT 


CACCCCTACT 


GGATAAAACT 


GAAGGAGATC 


2281 


GTGAACTTAA 


TCGTCATGGA 


CCCTTTTGTA 


GACTTAGCCA 


TC AC CATC TG 


CATCGTTCTG 


2341 


AATACGCTAT 


TTATGGCAAT 


GGAGCACCAT 


CCCATGACAC 


CACAGTTCGA 


ACACGTCTTG 


2401 


GCCGTAGGAA 


ATCTGGTGTT 


CACCGGGATC 


TTCACGGCGG 


AAATGTTTCT 


GAAGCTCATA 


2461 


GCCATGGACC 


CCTACTATTA 


TTTCCAAGAA 


GGCTGGAACA 


TTTTTGACGG 


ATTTATTGTC 


2521 


TCCCTCAGTT 


TAATGGAGCT 


GAGTCTCGCA 


GATGTGGAGG 


GGCTCTCAGT 


GCTGCGGTCT 


2581 


TTCCGACTGC 


TCCGAGTCTT 


CAAGCTGGCC 


AAGTCCTGGC 


CCACCCTGAA 


CATGCTGATC 


2641 


AAGATCATCG 


GGAACTCCGT 


GGGTGCCCTG 


GGCAACCTGA 


CCCTGGTGCT 


GGCCATCATC 


2701 


GTCTTCATCT 


TCGCCGTGGT 


GGGGATGCAG 


CTGTTTGGAA 


AGAGTTACAA 


GGAGTGCGTC 


2761 


TGTAAGATCA 


ACCAGGAGTG 


CAAGCTCCCG 


CGCTGGCACA 


TGAACGACTT 


CTTCCACTCC 


2821 


TTCCTCATCG 


TCTTCCGAGT 


GCTGTGTGGG 


GAGTGGATCG 


AG AC C ATGTG 


GGACTGCATG 


2881 


GAGGTGGCCG 


GCCAGGCCAT 


GTGCCTCATT 


GTCTTCATGA 


TGGTTATGGT 


CATTGGCAAC 


2941 


CTGGTGGTGC 


TGAATCTATT 


CCTGGCCTTG 


CTTCTGAGCT 


CCTTCAGCGC 


AGACAACCTG 


3001 


GCGGCCACAG 


ACGACGACGG 


GGAAATGAAC 


AACCTGCAGA 


TCTCAGTGAT 


CCGGATCAAG 


3061 


AAGGGCGTGG 


CCTGGACCAA 


AGTGAAGGTG 


CACGCCTTCA 


TGCAGGCTCA 


CTTCAAGCAG 


3121 


CGGGAGGCGG 


ATGAAGTGAA 


ACCCCTCGAC 


GAGCTGTATG 


AGAAGAAGGC 


CAACTGCATC 


3181 


GCCAACCACA 


CGGGCGTGGA 


TATCCACCGG 


AACGGCGACT 


TCCAGAAGAA 


CGGGAACGGA 


3241 


ACCACCAGCG 


GCATCGGCAG 


CAGCGTGGAG 


AAGTACATCA 


TCGACGAGGA 


CCACATGTCC 


3301 


TTCATTAACA 


ACCCAAACCT 


GACCGTCCGG 


GTGCCCATTG 


CTGTGGGCGA 


GTCTGACTTC 


3361 


GAGAACCTCA 


ACACAGAGGA 


TGTTAGCAGC 


GAATCAGACC 


CTGAAGGCAG 


CAAAGATAAA 


3421 


CTGGACGATA 


CCAGCTCCTC 


AGAAGGAAGT 


ACCATCGACA 


TCAAGCCTGA 


GGTGGAAGAA 


3481 


GTTCCCGTGG 


AGCAACCTGA 


GGAATACTTG 


GATCCGGACG 


CCTGCTTTAC 


AGAGGGTTGC 


3541 


GTCCAGCGGT 


TCAAGTGCTG 


CCAGGTCAAC 


ATCGAGGAAG 


GACTAGGCAA 


GTCGTGGTGG 


3601 


ATCTTGCGGA 


AAACCTGCTT 


CCTCATTGTG 


GAGCACAATT 


GGTTTGAGAC 


CTTCATCATC 


3661 


TTCATGATTC 


TGCTCAGCAG 


TGGCGCCCTG 


GCCTTTGAGG 


ACATCTACAT 


TGAGCAGAGG 


3721 


AAGACCATCC 


GCACCATCCT 


GGAGTATGCG 


GACAAGGTCT 


TCACCTACAT 


CTTCATCCTG 
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37 81 GAGATGTTGC TC AAGTGG AC AGCCTACGGC TTCGTCAAGT TCTTCACCAA TGCCTGGTGC 
3 841 TGGTTGGACT TCCTCATTGT GGCTGTCTCT TTAGTCAGCC TTATAGCTAA TGCCCTGGGC 
3901 TACTCGGAAC TAGGTGCCAT AAAGTCCCTT AGGACCCTAA GAGCTTTGAG ACCCTTAAGA 
3 961 GCCTTATCAC GATTTGAAGG GATGAGGGTG GTGGTGAATG CCTTGGTGGG CGCCATCCCC 
5 4021 TCCATCATGA ATGTGCTGCT GGTGTGTCTC ATCTTCTGGC TGATTTTCAG CATCATGGGA 
4081 GTTAACCTGT TTGCGGGGAA ATACCACTAC TGCTTTAATG AGACTTCTGA AATCCGGTTC 
4141 GAAATCGATA TTGTCAACAA TAAAACGGAC TGTGAGAAGC TCATGGAGGG CAACAGCACG 
4201 GAGATCCGAT GGAAGAATGT CAAGATCAAC TTTGACAATG TCGGAGCAGG GTACCTGGCC 

42 61 CTTCTTCAAG TGGCAACCTT CAAAGGCTGG ATGGACATCA TGTATGCGGC TGTAGATTCC 
10 43 21 CGAAAGCCAG ACGAGCAGCC TGACTACGAG GGCAACATCT ACATGTACAT C T AC TTCGTC 

43 81 ATCTTCATCA TCTTCGGCTC CTTCTTCACC CTCAACCTGT TCATCGGTGT CATCATCGAC 
4441 AACTTCAACC AGCAGAAGAA AAAGTTTGGA GGTCAGGACA TCTTCATGAC AGAGGAACAG 
4501 AAGAAGTACT ACAATGCCAT GAAAAAGCTG GGCTCCAAGA AGCCACAGAA GCCCATCCCC 
4561 CGACCCTTGA ACAAAATCCA AGGGATTGTC TTTGATTTCG TCACTCAACA AGCCTTTGAC 

15 4621 ATTGTGATCA TGATGCTCAT CTGC CTTAAC ATGGTGACAA TGATGGTGGA GACAGACACT 
4681 CAGAGCAAGC AGATGGAGAA CATTCTTTAC TGGATTAATC TGGTCTTTGT CATCTTCTTC 
4741 ACCTGCGAGT GTGTGCTCAA AATGTTTGCC TTGAGACACT ACTATTTCAC CATTGGCTGG 
4801 AACATCTTTG ACTTTGTGGT GGTCATCCTC TCCATTGTGG GAATGTTCCT GGCTGATATC 
4861 ATTGAGAAGT ACTTCGTCTC CCCAACCCTA TTCCGAGTTA TCCGATTGGC CCGTATTGGG 

20 4921 CGCATCTTGC GTCTGATCAA GGGCGCCAAA GGGATCCGCA CCCTGCTCTT TGCCTTAATG 
4 9 81 ATGTCGCTGC CCGCCCTGTT CAACATCGGC CTCCTGCTCT TCCTCGTCAT GTTCATCTTC 
5041 TCCATTTTTG GCATGTCCAA CTTCGCATAC GTGAAGCACG AGGCCGGCAT TGACGACATG 
5101 TTCAACTTCG AGACATTTGG CAACAGCATG ATCTGTTTGT TCCAGATCAC AACGTCTGCT 
5161 GGCTGGGATG GCCTGCTGCT GCCAATCCTG AACCGCCCCC CTGACTGCAG CTTGGACAAA 

25 5221 GAGCACCCAG GGAGTGGCTT CAAAGGGGAC TGTGGGAACC CCTCGGTGGG CATCTTCTTC 

52 81 TTTGTGAGCT ACATCATCAT CTCCTTCCTG ATTGTGGTGA ACATGTACAT CGCCATCATC 

53 41 CTGGAGAACT TCAGCGTGGC CACCGAGGAG AGCGCCGACC CTCTGAGTGA GGATGACTTC 
5401 GAGACTTTCT ATGAGATCTG GGAGAAGTTT GACCCAGACG CCACCCAGTT CATCGAGTAC 
5461 TGTAAGCTGG C AG AC TTTGC CGACGCCCTG GAGCACCCGC TCCGAGTACC CAAGCCCAAC 

30 5521 ACCATCGAGC TCATCGCCAT GGACCTGCCC ATGGTGAGCG GAGATCGCAT CCACTGCTTG 
5581 GACATCCTTT TCGCCTTCAC CAAGCGAGTC CTGGGAGACA GTGGGGAGTT GGACATCCTG 
5 641 CGGCAGCAGA TGGAGGAGCG GTTCGTGGCA TCCAATCCTT CCAAAGTGTC TTACGAGCCT 
5701 ATCACAACCA CTCTGCGGCG CAAGCAGGAG GAGGTGTCTG CAGTGGTCCT GCAGCGTGCC 
5761 TACAGGGGAC ACTTGGCTAG GCGGGGCTTC ATCTGCAGAA AGATGGCCTC CAACAAGCTG 

35 5 821 GAGAATGGAG GC AC AC AC AG AGACAAGAAG GAGAGCACCC CGTCCACAGC CTCCCTCCCC 
5881 TCTTACGACA GCGTCACAAA GCCAGACAAG GAGAAGCAGC AGCGTGCGGA GG AGGGC AG A 
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59 41 AGGGAAAGAG CCAAGAGGCA AAAAGAGGTC AGGGAGTCCA AGTGCTAGAG GAGGGGAAAG 
6 001 GAAGCTT 

(4) INFORMATION FOR SEQ ID NO:3: 

(0 SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1978 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES (although, functionally expressed) 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat 
(F) TISSUE TYPE: dorsal root ganglia 
1 5 (G) CELL TYPE: peripheral nerve 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3 : 



10 



Met Ala Ala Arg Leu Leu Al 



a Pro Pro Gly Pro Asp Ser Phe Lys 



5 

20 Pro Phe Thr Pro Glu Ser 



10 



15 



Leu Ala Asn He Glu Arg Arg He Ala 



20 25 



30 



Glu Ser Lys Leu Lys Lys Pro 



Pro Lys Ala Asp Gly ser His Arg 



45 



35 40 
Glu Asp Asp Glu Asp Ser Lys Pro Lys Pro Asn Ser Asp Leu Glu 

25 



50 55 



60 



Ala Gly Lys Ser Leu 



Pro Phe He Tyr Gly Asp He Pro Gin Gly 



65 70 



75 



Leu val Ala Val Pro Leu Glu 



Asp Phe Asp Pro Tyr Tyr Leu Thr 



80 85 



30 



90 



Gin Lys Thr Phe Val Val Leu 



Asn Arg Gly Lys Thr Leu Phe Arg 



95 ioo 



105 



Phe Ser Ala Thr Pro Ala 



Leu Tyr He Leu Ser Pro Phe Asn Leu 



120 



110 115 
He Arg Arg He Ala He Lys He Leu lie His Ser Val Phe Ser 

35 



125 130 



135 
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Met lie He Met Cys Thr lie Leu Thr Asn Cys Val Phe Met Thx 

140 145 150 

Phe Ser Asn Pro Pro Glu Trp Ser Lys Asn Val Glu Tyr Thr Phe 

155 160 165 

5 Thr Gly lie Tyr Thr Phe Glu Ser Leu Val Lys He He Ala Arg 

170 175 180 

Gly Phe Cys lie Asp Gly Phe Thr Phe Leu Arg Asp Pro Trp Asn 

185 190 195 

Trp Leu Asp Phe Ser Val lie Met Met Ala Tyr Val Thr Glu Phe 

10 200 205 210 

Val Asp Leu Gly Asn Val Ser Ala Leu Arg Thr Phe Arg Val Leu 

215 220 225 

Arg Ala Leu Lys Thr He Ser Val He Pro Gly Leu Lys Thr He 

230 235 240 

15 Val Gly Ala Leu He Gin Ser Val Lys Lys Leu Ser Asp Val Met 

245 250 255 

He Leu Thr Val Phe Cys Leu Ser Val Phe Ala Leu He Gly Leu 

260 265 270 

Gin Leu Phe Met Gly Asn Leu Arg Asn Lys Cys Val Val Trp Pro 

20 275 280 285 

He Asn Phe Asn Glu Ser Tyr Leu Glu Asn Gly Thr Arg Gly Phe 

290 295 300 

Asp Trp Glu Glu Tyr He Asn Asn Lys Thr Asn Phe Tyr Met Val 

305 310 315 

25 Pro Gly Met Leu Glu Pro Leu Leu Cys Gly Asn Ser Ser Asp Ala 

320 325 330 

Gly Gin Cys Pro Glu Gly Phe Gin Cys Met Lys Ala Gly Arg Asn 

335 340 345 

Pro Asn Tyr Gly Tyr Thr Ser Phe Asp Thr Phe Ser Trp Ala Phe 

30 350 355 360 

Leu Ala Leu Phe Arg Leu Met Thr Gin Asp Tyr Trp Glu Asn Leu 

365 370 375 

Tyr Gin Leu Thr Leu Arg Ala Ala Gly Lys Thr Tyr Met He Phe 

380 385 390 

35 Phe Val Leu Val He Phe Val Gly Ser Phe Tyr Leu Val Asn Leu 

395 400 405 
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He Leu Ala 



Thr Leu Glu 



3 Leu Glu Gin 



Met Ala Thr 



10 



15 



20 



25 



30 



35 



Glu Gly Glu 



Ser Lys Leu 



Lys Lys Arg 



Asp Pro Glu 



Arg Lys Ala 



Ser lie Met 



Leu Ser Arg 



Pro Gly Arg 



Asp Glu His 



Leu Phe He 



Gly Tyr Ser 



Pro Ser Leu 



Asn Gly Val 



Leu Leu Pro 



Val Val 
410 

Glu Ala 
425 

Leu Lys 
440 

Ser Ala 
455 

Asp Gly 
470 

Ser Ser 
485 

Lys Gin 
500 

Lys Val 
515 

Phe Arg 
530 

Asn Gin 
545 

His Asn 
560 

Phe Arg 
575 

Ser Thr 
590 

Pro He 
605 

Gly Tyr 
620 

Arg Arg 
635 

Val Ser 
650 

Glu Ala 
665 



Ala Met Ala Tyr 



Glu Gin Lys Glu 



Lys Gin Gin Glu 



Gly Thr Val Ser 



Val Gly Ser Pro 



Lys Ser Ala Lys 



Lys Glu Leu Ser 



Phe Lys Ser Glu 



Leu Pro Asp Asn 



Ser Leu Leu Ser 



Ser Lys Ser Ser 



Asp Pro Gly Ser 



Val Glu Glu Ser 



Arg Ala Arg Glu 



Ser Gin Cys Ser 



Ser Val Lys Arg 



Leu He Gly Pro 



Thr Thr Glu Val 



Glu 
415 
Ala 
430 
Glu 
445 
Glu 
460 
Arg 
475 
Glu 
490 
Glu 
505 
Ser 
520 
Arg 
535 
He 
550 
He 
565 
Glu 
580 
Glu 
595 
Arg 
610 
Arg 
625 
Asn 
640 
Gly 
655 
Glu 
670 



Glu Gin Asn 



Glu Phe Lys 



Ala Gin Ala 



Asp Ala He 



Ser Ser Ser 



Arg Arg Asn 



Gly Glu Glu 



Glu Asp Gly 



He Gly Arg 



Pro Gly ser 



Phe Ser Phe 



Asn Glu Phe 



Gly Arg Arg 



Arg Ser Ser 



Ser Ser Arg 



Sei Thr Val 



Ser His He 



He Lys Lys 



Gin Ala 
420 

Ala Met 
435 

Ala Ala 
450 

Glu Glu 
465 

Glu Leu 
480 

Arg Arg 
495 

Lys Gly 
510 

Met Arg 
525 

Lys Phe 
540 

Pro Phe 
555 

Arg Gly 
570 

Ala Asp 
585 

Asp Ser 
600 

Tyr Ser 
615 

He Phe 
630 

Asp Cys 
645 

Gly Arg 
660 

Lys Gly 
675 
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Pro Gly Ser Leu Leu Val Ser Met Asp Gin Leu Ala Ser Tyr Gly 

680 685 690 

Arg Lys Asp Arg lie Asn Ser lie Met Ser Val Val Thr Asn Thr 

695 700 705 

5 Leu Val Glu Glu Leu Glu Glu Ser Gin Arg Lys Cys Pro Pro Cys 

710 715 720 

Trp Tyr Lys Phe Ala Asn Thr Phe Leu lie Trp Glu Cys His Pro 

725 730 735 

Tyr Trp He Lys Leu Lys Glu He Val Asn Leu He Val Met Asp 

10 740 745 750 

Pro Phe Val Asp Leu Ala lie Thr He Cys He Val Leu Asn Thr 

755 760 765 

Leu Flit; Met Ala Met Glu His His Fro Met Thr Pre Gin Phe Glu 

770 775 780 

15 His Val Leu Ala Val Gly Asn Leu Val Phe Thr Gly He Phe Thr 

785 790 795 

Ala Glu Met Phe Leu Lys Leu He Ala Met Asp Pro Tyr Tyr Tyr 

800 805 810 

Phe Gin Glu Gly Trp Asn He Phe Asp Gly Phe He Val Ser Leu 

20 815 520 825 

Ser Leu Met Glu Leu Ser Leu Ala Asp Val Glu Gly Leu Ser Val 

830 835 840 

Leu Arg Ser Phe Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser 

845 850 855 

25 Trp Pro Thr Leu Asn Met Leu He Lys lie He Gly Asn Ser Val 

860 865 870 

Gly Ala Leu Gly Asn Leu Thr Leu Val Leu Ala He He Val Phe 

875 880 885 

lie Phe Ala Val Val Gly Met Gin Leu Phe Gly Lys Ser Tyr Lys 

30 890 895 900 

Glu Cys Val Cys Lys He Asn Gin Glu Cys Lys Leu Pro Arg Trp 

905 910 915 

His Met Asn Asp Phe Phe His Ser Phe Leu He Val Phe Arg Val 

920 925 930 

35 Leu Cys Gly Glu Trp He Glu Thr Met Trp Asp Cys Met Glu Val 

935 940 945 
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40 



Ala Gly Gin Ala 



Cie Gly Asn Leu 



3 Ser Ser Phe Ser 



Glu Met Asn Asn 



Val Ala Trp Thr 



Phe Lys Gin Arg 



Tyr Glu Lys Lys 



15 He Hi 



20 



s Arg Asn 



Ser Gly He Gly 



His Met Ser Phe 



He Ala Val Gly 



Val Ser Ser Glu 



Asp Thr Ser Ser 



Val Glu Glu Val 



Asp Ala Cys Phe 



Gin Val Asn He 



Arg Lys Thr Cys 



-O Phe He He Phe 



25 



50 



Met 
950 

Val Val 
965 

Ala Asp 
980 

Leu Gin 
995 

Lys Val 
1010 

Glu Ala 
1025 

Ala Asn 
1040 

Gly Asp 
1055 

Ser Ser 
1070 
He 
1085 

Glu Ser 
1100 

Ser Asp 
1115 

Ser Glu 
1130 

Pro Val 
1145 
Thr 
1160 

Glu Glu 
1175 

Phe Leu 
1190 
Met 
1205 



Cys Leu He Val 



Leu Asn Leu 



Asn Leu Ala 



He Ser Val 



Lys Val His 



Asp Glu Val 



Cys He Ala 



Phe Gin Lys 



Val Glu Lys 



Asn Asn Pro Asn 



Asp Phe Glu 



Pro Glu Gly 



Gly Ser Thr 



Glu Gin Pro 



Glu Gly Cys Val 



Gly Leu Gly 



He Val Glu 



He Leu Leu Ser 



Phe 

955 

Phe 

970 

Ala 

985 

He 

1000 

Ala 

1015 

Lys 

1030 

Asn 

1045 

Asn 

1060 

Tyr 

1075 

Leu 

1090 

Asn 

1105 

Ser 

1120 

He 

1135 

Glu 

1150 

Gin 

1165 

Lys 

1180 

His 

1195 

Ser 

1210 



Met Met Val 



Leu Ala Leu 



rhr Asp Asp 



Arg He Lys 



Phe Met Gin 



Pro Leu Asp 



His Thr Gly 



Gly Asn Gly 



He He Asp 



Thr Val Arg 



Leu Asn Thr 



Lys Asp Lys 



Asp He Lys 



Glu Tyr Leu 



Arg Phe Lys 



Ser Trp Trp 



Asn Trp Phe 



Gly Ala Leu 



Met Val 
960 

Leu Leu 
975 

Asp Gly 
990 

Lys Gly 

1005 
Ala His 

1020 
Glu Leu 

1035 
Val Asp 

1050 
Thr Thr 

1065 
Glu Asp 

1080 
Val Pro 

1095 
Glu Asp 

1110 
Leu Asp 

1125 
Pro Glu 

1140 
Asp Pro 

1155 
Cys Cys 

1170 
He Leu 

1185 
Glu Thr 

1200 
Ala Phe 

1215 
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Glu Asp He Tyr He Glu Gin Arg Lys Thr He Arg Thr tie Leu 

1220 1225 1230 

Glu Tyr Ala Asp Lys Val Phe Thr Tyr He Phe He Leu Glu Met 

1235 1240 1245 

5 Leu Leu Lys Trp Thr Ala Tyr Gly Phe Val Lys Phe Phe Thr Asn 

1250 1255 1260 

Ala Trp Cys Trp Leu Asp Phe Leu He Val Ala Val Ser Leu Val 

1265 1270 1275 

Ser Leu He Ala Asn Ala Leu Gly Tyr Ser Glu Leu Gly Ala He 

10 1280 1285 1290 

Lys Ser Leu Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu 

12 95 1300 1305 

3 __ A _g nu £ Giu Qiy M St; a rg va.1 Val Val Asn Ala Leu Val Gly 

1310 1315 1320 

15 Ala He Pro Ser He Met Asn Val Leu Leu Val Cys Leu He Phe 

1325 1330 1335 

Trp Leu He Phe Ser He Met Gly Val Asn Leu Phe Ala Gly Lys 

1340 1345 1350 

Tyr His Tyr Cys Phe Asn Glu Thr Ser Glu He Arg Phe Glu He 

20 X355 1360 1365 

Asp He Val Asn Asn Lys Thr Asp Cys Glu Lys Leu Met Glu Gly 

1370 1375 1380 

Asn Ser Thr Glu He Arg Trp Lys Asn Val Lys He Asn Phe Asp 

1385 1390 1395 

25 Asn Val Gly Ala Gly Tyr Leu Ala Leu Leu Gin Val Ala Thr Phe 

1400 1405 1410 

Lys Gly Trp Met Asp He Met Tyr Ala Ala Val Asp Ser Arg Lys 

14 15 1420 1425 

Pro Asp Glu Gin Pro Asp Tyr Glu Gly Asn He Tyr Met Tyr He 

30 1430 1435 1440 

Tyr Phe Val He Phe He He Phe Gly Ser Phe Phe Thr Leu Asn 

1445 1450 1455 

Leu Fhe He Gly Val He He Asp Asn Phe Asn Gin Gin Lys Lys 

1460 1465 1470 

35 Lys Phe Gly Gly Gin Asp He Phe Met Thr Glu Glu Gin Lys Lys 

14 75 1480 1485 
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Tyr Tyr Asn Ala Met Lys Lys Leu Gly Ser Lys Lys Pro Gin Lys 

1490 1495 



Pro lie Pro Arg Pro Leu Asn Lys He Gin Gly He Val Phe 

1505 1510 

5 Phe val Thr Gin Gin Ala Phe Asp n e Val He Met Met Leu 

1520 1525 

Cys Leu Asn Met Val Thr Met Met Val Glu 

1535 

Lys Gin Met Glu Asn He Leu Tyr Trp He Asr. Leu Val 

10 



1500 
Asp 
1515 
He 
1530 

Thr Asp Thr Gin Ser 



1550 1555 
He Phe Phe Thr Cys Glu Cys Val Leu Lys Met Phe Ala 

1565 1570 



Phe Val 
1560 



Leu Arg 
1575 



His Tyr Tyr Phe Thr He Gly Trp Asn He Phe Asp Phe Val 

1580 1585 



val 
1590 

15 Val He Leu ser He Val Gly Met Phe Leu Ala Asp He He Glu 



1595 1600 1605 

Lys Tyr Phe Val Ser Pro Thr Leu Phe Arg Val He Arg Leu Ala 

1610 1615 
Arg He Gly Arg He Leu Arg Leu He Lys Gly Ala Lys Gly lie 

20 

1635 



"25 1630 



Arg Thr Leu Leu Phe Ala 



Leu Met Met Ser Leu Pro Al 



1640 1645 



Asn He Gly Leu Leu Leu Phe 



Leu Va 1 Met Phe I le phe 



"55 1660 
25 Phe Gly Met Ser Asn Phe 



1670 X675 



Asp Asp Met Phe Asn Phe Glu Thr 



1685 1690 



Le u Phe Gin He Thr Thr 



Pro He Leu Asn Arg Pro Pro 



1715 1720 



Pro Gly ser Gly Phe 



1730 1735 



35 



1*™5 1750 



1620 



a Leu Phe 



1650 



Ser He 



1665 

Ala Tyr Val Lys His Glu Ala Gly He 



1680 



Phe Gly Asn Ser Met He Cys 



1695 



Ser Ala Gly Trp Asp Gly Leu Leu Leu 

30 

JU 1700 1705 



1710 

Asp Cys Ser Leu Asp Lys Glu His 



1725 



Lys Gly Asp Cys Gly Asn Pro Ser Val Gly 



1740 



He Phe Phe Phe Val Ser Tyr He He He Ser Phe Leu He Val 



1755 
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Val Asn Met Tyr lie Ala lie lie Leu Glu Asn Phe Ser Val Ala 

1760 1765 1770 

Thr Glu Glu Ser Ala Asp Pro Leu Ser Glu Asp Asp Phe Glu Thr 

1775 1780 1785 

5 Phe Tyr Glu He Trp Glu Lys Phe Asp Pro Asp Ala Thr Gin Phe 

1790 1795 1800 

He Glu Tyr Cys Lys Leu Ala Asp Phe Ala Asp Ala Leu Glu His 

1805 1810 1815 

Pro Leu Arg Val Pro Lys Pro Asn Thr lie Glu Leu He Ala Met 

0 1820 1825 1830 

Asp Leu Pro Met Val Ser Gly Asp Arg He His Cys Leu Asp He 

1835 1840 1845 

Leu Phe Ala Phe Tlir Lys Arg Val Leu Gly Asp Ser Gly Glu Leu 

1850 1855 I860 

5 Asp He Leu Arg Gin Gin Met Glu Glu Arg Phe Val Ala Ser Asn 

1865 1870 1875 

Pro Ser Lys Val Ser Tyr Glu Pro He Thr Thr Thr Leu Arg Arg 

1880 1885 1890 

Lys Gin Glu Glu Val Ser Ala Val Val Leu Gin Arg Ala Tyr Arg 

1895 1900 1905 

Gly His Leu Ala Arg Arg Gly Phe He Cys Arg Lys Met Ala Ser 

1910 1915 1920 

Asn Lys Leu Glu Asn Gly Gly Thr His Arg Asp Lys Lys Glu Ser 

1925 1930 1935 

25 Thr Pro Ser Thr Ala Ser Leu Pro Ser Tyr Asp Ser Val Thr Lys 

1940 1945 1950 

Pro Asp Lys Glu Lys Gin Gin Arg Ala Glu Glu Gly Arg Arg Glu 

1955 1960 1965 

Arg Ala Lys Arg Gin Lys Glu Val Arg Glu Ser Lys Cys 

1970 1975 



20 



30 



(5) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1 988 amino acids 
3 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: 
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(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
(vi) ORIGINAL SOURCE: 

5 (A) ORGANISM: rat 

(F) TISSUE TYPE: dorsal root ganglia 

(G) CELL TYPE: peripheral nerve 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

10 Met Ala Ala Arg Leu Leu Ala Pro Pro Gly Pro Asp Ser Phe Lys 

5 10 15 

Pro Phe Thr Pro Glu Ser Leu Ala Asn lie Glu Arg Arg He Ala 

20 25 30 

Glu Ser Lys Leu Lys Lys Pro Pro Lys Ala Asp Gly Ser His Arg 

15 35 40 45 

Glu Asp Asp Glu Asp Ser Lys Pro Lys Pro Asn Ser Asp Leu Glu 

50 55 60 

Ala Gly Lys Ser Leu Pro Phe He Tyr Gly Asp He Pro Gin Gly 

65 70 75 

20 Leu Val Ala Val Pro Leu Glu Asp Phe Asp Pro Tyr Tyr Leu Thr 

80 85 90 

Gin Lys Thr Phe Val Val Leu Asn Arg Gly Lys Thr Leu Phe Arg 

95 100 105 

Phe Ser Ala Thr Pro Ala Leu Tyr lie Leu Ser Pro Phe Asn Leu 

25 110 115 120 

He Arg Arg He Ala He Lys He Leu He His Ser Val Phe Ser 

125 130 135 

Met He He Met Cys Thr He Leu Thr Asn Cys Val Phe Met Thr 

140 145 150 

30 Phe Ser Asn Pro Pro Glu Trp Ser Lys Asn Val Glu Tyr Thr Phe 

155 160 165 

Thr Gly He Tyr Thr Phe Glu Ser Leu Val Lys lie He Ala Arg 

170 175 180 

Gly Phe Cys He Asp Gly Phe Thr Phe Leu Arg Asp Pro Trp Asn 

35 185 190 195 
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Trp Leu Asp Phe Ser Val lie Met Met Ala Tyr Val Thr Glu Phe 

200 205 210 

Val Asp Leu Gly Asn Val Ser Ala Leu Arg Thr Phe Arg Val Leu 

215 220 225 

5 Arg Ala Leu Lys Thr lie Ser Val lie Pro Gly Leu Lys Thr lie 

230 235 240 

Val Gly Ala Leu lie Gin Ser Val Lys Lys Leu Ser Asp Val Met 

245 250 255 

lie Leu Thr Val Phe Cys Leu Ser Val Phe Ala Leu lie Gly Leu 

10 260 265 270 

Gin Leu Phe Met Gly Asn Leu Arg Asn Lys Cys Val Val Trp Pro 

275 280 285 

lie Asn Phe Asn Glu Ser Tyr Leu Glvi Asn Gly Thr Arg Gly Phe 

290 295 300 

15 Asp Trp Glu Glu Tyr lie Asn Asn Lys Thr Asn Phe Tyr Met Val 

305 310 315 

Pro Gly Met Leu Glu Pro Leu Leu Cys Gly Asn Ser Ser Asp Ala 

320 325 330 

Gly Gin Cys Pro Glu Gly Phe Gin Cys Met Lys Ala Gly Arg Asn 

20 335 340 345 

Pro Asn Tyr Gly Tyr Thr Ser Phe Asp Thr Phe Ser Trp Ala Phe 

350 355 360 

Leu Ala Leu Phe Arg Leu Met Thr Gin Asp Tyr Trp Glu Asn Leu 

365 370 375 

25 Tyr Gin Leu Thr Leu Arg Ala Ala Gly Lys Thr Tyr Met lie Phe 

380 385 390 

Phe Val Leu Val lie Phe Val Gly Ser Phe Tyr Leu Val Asn Leu 

395 400 405 

lie Leu Ala Val Val Ala Met Ala Tyr Glu Glu Gin Asn Gin Ala 

30 410 415 420 

Thr Leu Glu Glu Ala Glu Gin Lys Glu Ala Glu Phe Lys Ala Met 

425 430 435 

Leu Glu Gin Leu Lys Lys Gin Gin Glu Glu Ala Gin Ala Ala Ala 

440 445 450 

35 Met Ala Thr Ser Ala Gly Thr Val Ser Glu Asp Ala lie Glu Glu 

455 460 465 
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GIu Gly Glu Asp Gly Val Gly Ser Pro Arg Ser Ser Ser Glu Leu 

470 475 480 

Ser Lys Leu Ser Ser Lys Ser Ala Lys Glu Arg Arg Asn Arg Arg 

485 490 495 

3 ^ys Lys Arg Lys Gin Lys Glu Leu Ser Glu Gly Glu Glu Lys Gly 

500 505 510 

Asp Pro Glu Lys Val Phe Lys Ser Glu Ser Glu Asp Gly Met Arg 

515 520 525 

Arg Lys Ala Phe Arg Leu Pro Asp Asn Arg lie Gly Arg Lys Phe 

10 530 535 540 

Ser He Met Asn Gin Ser Leu Leu Ser He Pro Gly Ser Pro Phe 

545 550 555 

Leu Ser Arg His Asn Ser Lys Ser Ser He Phe Ser Phe Arg Gly 

560 565 570 

15 Pro Gly Arg Phe Arg Asp Pro Gly Ser Glu Asn Glu Phe Ala Asp 

575 580 585 

Asp Glu His Ser Thr Val Glu Glu Ser Glu Gly Arg Arg Asp Ser 

590 595 600 

Leu Phe He Pro He Arg Ala Arg Glu Arg Arg Ser Ser Tyr Ser 

20 S °5 610 615 

Gly Tyr Ser Gly Tyr Ser Gin Cys Ser Arg Ser Ser Arg He Phe 

620 625 630 

Pro Ser Leu Arg Arg Ser Val Lys Arg Asn Ser Thr Val Asp Cys 

635 640 645 

25 Asn Gly Val Val Ser Leu He Gly Pro Gly Ser His He Gly Arg 

650 655 660 

Leu Leu Pro Glu Val Lys He Asp Lys Ala Ala Thr Asp Ser Ala 

665 670 675 

Thr Thr Glu Val Glu He Lys Lys Lys Gly Pro Gly Ser Leu Leu 

30 680 685 690 
Val Ser Met Asp Gin Leu Ala Ser Tyr Gly Arg Lys Asp Arg He 

595 700 705 

Asn Ser He Met Ser Val Val Thr Asn Thr Leu Val Glu Glu Leu 

710 715 720 

31 Glu Glu Ser Gin Arg Lys Cys Pro Pro Cys Trp Tyr Lys Phe Ala 

725 730 73 5 
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Asn 


Thr 


Phe 


Leu 


He 
740 


Trp 


Glu 


Cys 


His 


Pro 
745 


Tyr 


Trp 


He 


Lys 


Leu 
750 




Lys 


Glu 


He 


Val 


Asn 
755 


Leu 


lie 


Val 


Met 


Asp 
76 0 


Pro 


Phe 


Val 


Asp 


Leu 
765 


5 


Ala 


He 


Thr 


He 


Cys 
770 


He 


Val 


Leu 


Asn 


Thr 

"TIC 

/ / b 


Leu 


Phe 


Met 


A J. a 


Met 

"7 0 A 




Glu 


His 


His 


Pro 


Met 
785 


Thr 


Pro 


Gin 


Phe 


Glu 

^ n a 

/ y u 


His 


Val 


Leu 


Ala 


Val 
/ y b 




Gly 


Asn 


Leu 


Val 


Phe 


Thr 


Gly 


He 


Phe 


Thr 


Ala 


Glu 


Met 


Phe 


Leu 


10 










800 










O A C 










E3 1 A 
O XL) 




Lys 


Leu 


He 


Ala 


Met 
815 


Asp 


Pro 


Tyr 


Tyr 


Tyr 
820 


Phe 


Gin 


Glu 


Gly 


Trp 




Asn 


He 


Phe 


Asp 


Gly 
830 


Phe 


He 


Val 


Ser 


Leu 
83 5 


Ser 


Leu 


Met 


Glu 


Leu 

Q A A 

o4 U 


15 


Ser 


Leu 


Ala 


Asp 


Val 
845 


Glu 


Gly 


Leu 


Ser 


Val 
850 


Leu 


Arg 


ber 


file 


Arg 

o c c 

obb 




Leu 


Leu 


Arg 


Val 


Phe 
860 


Lys 


Leu 


Ala 


Lys 


Ser 

o c c 

865 


Trp 


Pro 


Thr 


Leu 


Asn 

OTA 

o / U 




Met 


Leu 


He 


Lys 


He 


He 


Gly 


Asn 


Ser 


Val 


Gly 


Ala 


Leu 


Gly 


Asn 


20 










875 










880 










Q Q 

bob 




Leu 


Thr 


Leu 


Val 


Leu 
890 


Ala 


He 


He 


Val 


Phe 

O Q C 


He 


Phe 


Ala 


Val 


Val 




Gly 


Mec 


Gin 


Leu 


Phe 
905 


Gly 


Lys 


Ser 


Tyr 


Lys 
910 


Glu 


Cys 


Val 


Cys 


Lys 

QIC 

j 1 D 


25 


He 


Asn 


Gin 


Glu 


Cys 
920 


Lys 


Leu 


Pro 


Arg 


Trp 
925 


His 


Met 


Asn 


Asp 


Phe 
930 




Phe 


His 


Ser 


Phe 


Leu 
935 


He 


Val 


Phe 


Arg 


Val 
940 


Leu 


Cys 


Gly 


Glu 


Trp 
945 




He 


Glu 


Thr 


Met 


Trp 


Asp 


Cys 


Met 


Glu 


Val 


Ala 


Gly 


Gin 


Ala 


Met 



30 950 955 960 

Cys Leu He Val Phe Met Met Val Met Val He Gly Asn Leu Val 

965 970 975 

Val Leu Asn Leu Phe Leu Ala Leu Leu Leu Ser Ser Phe Ser Ala 

980 985 990 

35 Asp Asn Leu Ala Ala Thr Asp Asp Asp Gly Glu Met Asn Asn Leu 

995 1000 1005 
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Gin He Ser Val He Arg He Lys Lys Gly Val Ala Trp Thr Lys 



1010 ioi5 



1025 1030 



1040 1045 



20 



30 



1020 



Val Lys Val His Ala Phe Met Gin Ala His Phe Lys Gin Arg Glu 



1035 



5 Ala Asp Glu Val Lys Pro Leu Asp Glu Leu Tyr Glu Lys Lys Ala 



1050 



Asn Cys He Ala Asn His Thr Gly Val Asp He His Arg Asn Gly 



1065 



1055 1060 

Asp Phe Gin Lys Asn Gly Asn Gly Thr Thr Ser Gly He Gly Ser 

10 1070 1075 1080 

Ser Val Glu Lys Tyr lie He Asp Glu Asp His Met Ser Phe He 

1085 1090 1095 

Asn Asn Pro Asn Leu Thr Val Arg Val Pro He Ala Val Gly Glu 

1100 1105 1110 

15 Ser Asp Phe Glu Asn Leu Asn Thr Glu Asp Val Ser Ser Glu Ser 

1H5 1120 1125 

Asp Pro Glu Gly Ser Lys Asp Lys Leu Asp Asp Thr Ser Ser Ser 

I 130 1135 1140 

Glu Gly Ser Thr He Asp He Lys Pro Glu Val Glu Glu Val Pro 

1145 1150 1155 

Val Glu Gin Pro Glu Glu Tyr Leu Asp Pro Asp Ala Cys Phe Thr 

1160 1165 1 170 

Glu Gly Cys Val Gin Arg Phe Lys Cys Cys Gin Val Asn He Glu 

11*75 1180 1185 

25 Glu Gly Leu Gly Lys Ser Trp Trp He Leu Arg Lys Thr Cys Phe 

1190 1195 1200 

Leu He Val Glu His Asn Trp Phe Glu Thr Phe He He Phe Met 

1205 1210 1215 

He Leu Leu Ser Ser Gly Ala Leu Ala Phe Glu Asp He Tyr He 

I 220 1225 1230 

Glu Gin Arg Lys Thr He Arg Thr He Leu Glu Tyr Ala Asp Lys 

1235 1240 1245 

Val Phe Thr Tyr He Phe He Leu Glu Met Leu Leu Lys Trp Thr 

1250 1255 1260 

35 Ala Tyr Gly Phe Val Lys Phe Phe Thr Asn Ala Trp Cys Trp Leu 

1265 1270 1275 
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Asp Phe Leu lie Val Ala Val Ser Leu Val Ser Leu lie Ala Asn 

1280 1285 1290 

Ala Leu Gly Tyr Ser Glu Leu Gly Ala lie Lys Ser Leu Arg Thr 

1295 1300 1305 

5 Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu Ser Arg Phe Glu Gly 

1310 1315 1320 

Met Arg Val Val Val Asn Ala Leu Val Gly Ala lie Pro Ser lie 

1325 1330 1335 

Met Asn Val Leu Leu Val Cys Leu lie Phe Trp Leu lie Phe Ser 

10 1340 1345 1350 

lie Met Gly Val Asn Leu Phe Ala Gly Lys Tyr His Tyr Cys Phe 

1355 1360 1365 

Asn Glu Thr Ser Glu lie Arg Phe Glu lie Asp lie Val Asn Asn 

1370 1375 1380 

15 Lys Thr Asp Cys Glu Lys Leu Met Glu Gly Asn Ser Thr Glu lie 

1385 1390 1395 

Arg Trp Lys Asn Val Lys lie Asn Phe Asp Asn Val Gly Ala Gly 

1400 1405 1410 

Tyr Leu Ala Leu Leu Gin Val Ala Thr Phe Lys Gly Trp Met Asp 

20 1415 1420 1425 

lie Met Tyr Ala Ala Val Asp Ser Arg Lys Pro Asp Glu Gin Pro 

1430 1435 1440 

Asp Tyr Glu Gly Asn lie Tyr Met Tyr lie Tyr Phe Val lie Phe 

1445 1450 1455 

25 lie lie Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe lie Gly Val 

1460 1465 1470 

lie lie Asp Asn Phe Asn Gin Gin Lys Lys Lys Phe Gly Gly Gin 

1475 1480 1485 

Asp lie Phe Met Thr Glu Glu Gin Lys Lys Tyr Tyr Asn Ala Met 

30 1490 1495 1500 

Lys Lys Leu Gly Ser Lys Lys Pro Gin Lys Pro lie Pro Arg Pro 

1505 1510 1515 

Leu Asn Lys lie Gin Gly lie Val Phe Asp Phe Val Thr Gin Gin 

1520 1525 1530 

35 Ala Phe Asp lie Val lie Met Met Leu lie Cys Leu Asn Met Val 

1535 1540 1545 
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Thr Met Met Val Glu Thr Asp Thr Gin Ser Lys Gin Met Glu Asn 

1550 1555 1560 

He Leu Tyr Trp He Asn Leu Val Phe Val He Phe Phe Thr Cys 

1565 1570 1575 

5 Glu Cys Val Leu Lys Met Phe Ala Leu Arg His Tyr Tyr Phe Thr 

1580 1585 1 59 0 

He Gly Trp Asn He Phe Asp Phe Val Val Val He Leu Ser He 

1595 1600 1 605 

Val Gly Met Phe Leu Ala Asp He He Glu Lys Tyr Phe Val Ser 

10 161 ° 1615 1 620 

Pro Thr Leu Phe Arg Val He Arg Leu Ala Arg He Gly Arg He 

1625 1630 !635 

Leu Arg Leu He Lys Gly Ala Lys Gly He Arg Thr Leu Leu Phe 

1640 1645 1650 

15 Ala Leu Met Met Ser Leu Pro Ala Leu Phe Asn He Gly Leu Leu 

1655 1660 1665 

Leu Phe Leu Val Met Phe He Phe Ser He Phe Gly Met Ser Asn 

16*70 1675 1680 

Phe Ala Tyr Val Lys His Glu Ala Gly He Asp Asp Met Phe Asn 

20 1685 1690 1695 

Phe Glu Thr Phe Gly Asn Ser Met He Cys Leu Phe Gin He Thr 

1700 1705 1710 

Thr Ser Ala Gly Trp Asp Gly Leu Leu Leu Pro He Leu Asn Arg 

1715 1720 1725 

25 Pro Pro Asp Cys Ser Leu Asp Lys Glu His Pro Gly Ser Gly Phe 

1730 1735 1740 

Lys Gly Asp Cys Gly Asn Pro Ser Val Gly He Phe Phe Phe Val 

1745 1750 1755 

Ser Tyr He He He Ser Phe Leu He Val Val Asn Met Tyr He 

30 1760 1765 1770 

Ala He He Leu Glu Asn Phe Ser Val Ala Thr Glu Glu Ser Ala 

1775 1780 1785 

Asp Pro Leu Ser Glu Asp Asp Phe Glu Thr Phe Tyr Glu He Trp 

1790 1795 1800 

35 Glu Lys Phe Asp Pro Asp Ala Thr Gin Phe He Glu Tyr Cys Lys 

1805 1810 1815 
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Leu Ala Asp Phe Ala Asp Ala Leu Glu His Pro Leu Arc Val Pro 

1820 1325 1830 

Lys Pro Asn Thr lie Glu Leu lie Ala Met Asp Leu Pro Met Val 

1935 1840 1845 

5 Ser Gly Asp Arg lie His Cys Leu Asp He Leu Phe Ala Phe Thr 

1850 1855 I860 

Lys Arg Val Leu Gly Asp Ser Gly Glu Leu Asp He Leu Arg Gin 

1865 1870 1875 

Gin Met Glu Glu Arg Phe Val Ala Ser Asn Pro Ser Lys Val Ser 
10 1880 1885 1890 

Tyr Glu Pro He Thr Thr Thr Leu Arg Arg Lys Gin Glu Glu Val 

1895 1900 1905 

Ser .Ala Val Val Leu Gin Arg Ala Tyr Arg Gly His Leu Ala Arg 

1910 1915 1920 

15 Arg Gly Phe He Cys Arg Lys Met Ala Ser Asn Lys Leu Glu Asn 

1925 1930 1935 

Gly Gly Thr His Arg Asp Lys Lys Glu Ser Thr Pro Ser Thr Ala 

1940 1945 1950 

Ser Leu Pro Ser Tyr Asp Ser Val Thr Lys Pro Asp Lys Glu Lys 
20 1955 I960 1965 

Gin Gin Arg Ala Glu Glu Gly Arg Arg Glu Arg Ala Lys Arg Gin 

1970 

Lys Glu Val Arg Glu Ser Lys Cys 

1985 

25 

(6) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RT-PCR 
(A) DESCRIPTION: /desc - ,JDNA probe" 

(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 



1975 1980 
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10 



15 



20 



25 



30 
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(vi) ORIGINAL SOURCE: 
(A) ORGANISM: rat 

(F) TISSUE TYPE: dorsal root ganglia 

(G) CELL TYPE: peripheral nerve 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

1 CTCAACATGG TTACTATGAT GGTGGAGACA GACACTCAGA GCAAGCAGAT 

51 GGAGAACATT CTTTACTGGA TTAATCTGGT CTTTGTCATC TTCTTCACCT 

101 GCGAGTGTGT GCTCAAAATG TTTGCCTTGA GACACTACTA TTTCACCATT 

151 GGCTGGAACA TCTTTGACTT TGTGGTGGTC ATCCTCTCCA TTGTGGGAAT 

2 01 GTTCCTGGCT GATATCATTG AGAAGTACTT CGTCTCCCCA ACCCTATTCC 

2 51 GAGTTATCCG ATTGGCCCGT ATTGGGCGCA TCTTGCGTCT GATCAAGGGG 

301 GCCAAAGGGA TCCGCACCCT GCTCTTTGGC CTTAATGATG TCGCTGGCCG 

351 CCCTGTTCAA CATCGCCTCC TGCTCTTCCT CGTCATGTTC ATCTTCTCCA 

401 TTTTTGGCAT GTCCAACTTC GCATACGTGA AGCACGAGGC CGGCATTGAC 

451 GACATGTTCA ACTTCGAGAC ATTTGGCAAC AGCATGATCT GTTTGTTCCA 

501 GATCACAACG TCTGCTGGCT GGGATGGCCT GCTGCTGCCA ATCCTGAACC 

551 GCCCCCCTGA CTGCAGCTTG GACAAAGAGC ACCCAGGGAG TGGCTTCAAA 

601 GGGGACTGTG GGAACCCCTC GGTGGGCATC TTCTTCTTTG TGAGCTACAT 

651 CATCATCTCC TTCCTGATTG TGGTGAACAT GTACATCGCA GTCATC 



35 
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(7) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
(vi) ORIGINAL SOURCE: 

10 (A) ORGANISM: rat 

(F) TISSUE TYPE: dorsal root ganglia 

(G) CELL TYPE: peripheral nerve 

(a) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

15 Leu Asn Met Val Thr Met Met Val Glu Thr Asp Thr Gin Ser Lys 

5 10 15 

Gin Met Glu Asn lie Leu Tyr Trp lie Asn Leu Val Phe Val lie 

20 25 30 

Phe Phe Thr Cys Glu Cys Val Leu Lys Met Phe Ala Leu Arg His 

20 35 40 45 

Tyr Tyr Phe Thr lie Gly Trp Asn lie Phe Asp Phe Val Val Val 

50 55 60 

lie Leu Ser lie Val Gly Met Phe Leu Ala Asp lie He Glu Lys 

65 70 75 

25 Tyr Phe Val Ser Pro Thr Leu Phe Arg Val He Arg Leu Ala Arg 

80 85 90 

He Gly Arg He Leu Arg Leu He Lys Gly Ala Lys Gly He Arg 

95 100 105 

Thr Leu Leu Phe Gly Leu Asn Asp Val Ala Gly Arg Pro Val Gin 

30 HO H5 12° 

His Arg Leu Leu Leu Phe Leu Val Met Phe He Phe Ser He Phe 

125 130 135 

Gly Met Ser Asn Phe Ala Tyr Val Lys His Glu Ala Gly He Asp 

140 145 150 

35 Asp Met Phe Asn Phe Glu Thr Phe Gly Asn Ser Met He Cys Leu 

155 160 165 
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Phe Gin He Thr Thr Ser Ala Gly Trp Asp Gly Leu Leu Leu Pro 

170 175 180 

He Leu Asn Arg Pro Pro Asp Cys Ser Leu Asp Lys Glu His Pro 

18 5 190 195 

5 Gly Ser Gly Phe Lys Gly Asp Cys Gly Asn Pro Ser Val Gly He 

200 205 21Q 

Phe Phe Phe Val Ser Tyr He He He Ser Phe Leu He Val Val 

215 220 225 

Asn Met Tyr He Ala Val He 



10 



230 



(8; INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6556 base pairs 
*5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
20 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: rat 

(F) TISSUE TYPE: Dorsal root ganglia 

(G) CELL TYPE: Peripheral nerve 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



30 



35 



CCAAGATGGC 


GCCCACCGCA 


GTCCCGCCCG 


CCGCAGCCTC 


GGCGCCTCTG 


50 


CAGTCCGGCC 


GCGCCTCCCG 


GGCCCCGCGC 


TAGGGCCGCT 


GCCGCCTCGC 


100 


CCGCCGCCGC 


CGCCGCCAGC 


TGACCTGTCC 


CGGACACATA 


AC TAACGAAG 


150 


CTGCTGCAGG 


ATGAGAAGM1 GGCAGCGCGG 


CTGCTCGCAC 


CACCAGGCCC 


200 


TGATAGTTTC 


AAGCCTTTCA 


CCCCTGAGTC 


GCTGGCAAAC 


ATCGAGAGGC 


250 


GTATTGCCGA 


GAGCAAGCTC 


AAGAAACCAC 


CAAAGGCGGA 


TGGCAGCCAC 


300 


CGGGAGGACG 


ATGAAGACAG 


CAAGCCCAAG 


CCAAACAGTG 


ACCTGGAGGC 


350 


TGGGAAGAGT 


TTGCCTTTCA 


TCTACGGGGA 


CATCCCGCAA 


GGCCTGGTTG 


400 


CGGTTCCCCT 


GGAGGACTTT 


GACCCTTACT 


ATTTGACGCA 


GAAAACCTTT 


450 


GTAGTATTAA 


ACAGAGGGAA 


AACTCTCTTC 


AGATTTAGTG 


CCACACCTGC 


500 
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CTTGTACATT TTAAGCCCTT TTAACCTGAT AAGAAGAATA GCTATTAAAA 5 50 

T TTTG AT AC A CTCAGTTTTC AGCATGATCA TCATGTGCAC CATCCTGACC 600 

AACTGTGTGT TCATGACCTT TAGTAACCCT CCAGAATGGT CCAAGAATGT 6 50 

GGAGTACACA TTCACAGGGA TTTACACATT TGAATCACTA GTGAAAATCA 7 00 

5 TCGCAAGAGG TTTCTGCATA GACGGCTTCA CCTTCTTGCG AG ACC CGTGG 7 50 

AACTGGTTAG ACTTCAGTGT CATCATGATG GCATATGTGA CAGAGTTTGT 800 

GGACCTGGGC AATGTCTCAG CGCTGAGAAC ATTCAGGGTT CTCCGAGCTT 8 50 

TGAAAACTAT CTCTGTAATT CCAGGCCTGA AGACAATCGT GGGCGCCCTA 900 

ATCCAGTCCG TGAAGAAGCT GTCGGACGTG ATGATCCTGA CAGTGTTCTG 950 

10 CCTGAGTGTT TTCGCCCTGA TTGGCCTGCA GCTCTTCATG GGGAACCTT - 9 99 

CGAAACAAGT GTGTCGTGTG GCCCATAAAC TTCAACGAGA GCTACCTGGA 1049 

GAACGGCACC AGAGGCTTTG ACTGGGAGGA ATATATCAAC AATAAAACAA 1099 

ACTTTTACAT GGTTCCTGGC ATGCTAGAAC CCTTGCTCTG CGGGAACAGT 1149 

TCTGATGCTG GGCAATGCCC AGAGGGATTC CAGTGCATGA AAGCAGGAAG 1199 

15 GAACCCCAAC TACGGTTACA CCAGCTTTGA CACCTTCAGC TGGGCCTTCT 12 49 

TGGCATTATT CCGCCTTATG ACCCAGGACT ATTGGGAGAA CTTATACCAG 1299 

CTGACCTTAC GAGCCGCTGG GAAAACGTAC ATGATCTTCT TTGTCTTGGT 1349 

CATCTTCGTG GGTTCTTTCT ATCTGGTGAA CTTGATCTTG GCTGTGGTGG 1399 

CCATGGCTTA TGAGGAACAG AACCAGGCAA CACTGGAGGA GGCAGAGCAA 1449 

20 AAAGAGGCCG AGTTCAAGGC AATGCTGGAG CAACTCAAGA AGCAGCAGGA 1499 

GGAGGCACAG GCTGCTGCAA TGGCCACCTC AGCGGGCACT GTCTCGGAAG 1549 

ACGCCATTGA AGAAGAAGGG GAAGATGGGG TAGGCTCTCC GAGGAGCTCT 1599 

TCTGAACTGT CTAAACTCAG TTCCAAGAGC GCGAAGGAGC GGCGGAACCG 1649 

ACGGAAGAAG AGGAAGCAGA AGGAGCTCTC TGAAGGCGAG GAGAAAGGGG 1699 

25 ACCCGGAGAA GGTGTTTAAG TCAGAGTCGG AAGACGGTAT GAGAAGGAAG 1749 

GCCTTCCGGC TGCCAGACAA CAGGATAGGG AGGAAGTTTT CCATCATGAA 1799 

TCAGTCGCTG CTCAGCATTC CAGGCTCGCC CTTCCTCTCC CGACATAACA 1849 

GCAAAAGCAG CATCTTCAGC TTCCGGGGAC CCGGTCGGTT CCGGGACCCC 1899 

GGCTCTGAGA ATGAGTTCGC AGACGATGAA CACAGCACCG TGGAGGAGAG 1949 

30 CGAGGGCCCG CGTGACTCGC TCTTCATCCC GATCCGCGCC CGCGAGCGCC 19 99 

GCAGCAGCTA CAGTGGCTAC AGCGGCTACA GCCAGTGCAG CCGCTCGTCG 2 049 

CGCATCTTCC CCAGCCTGCG GCGCAGCGTG AAGCGCAACA GCACGGTGGA 2099 

CTGCAACGGC GTAGTGTCAC TCATCGGGCC CGGCTCACAC ATCGGGCGGC 2149 

TCCTGCCTGA GGCAACGACT GAGGTGGAAA TTAAGAAGAA AGGCCCTGGA 2199 

35 TCTCTTTTAG TTTCTATGGA CCAACTCGCC TCCTACGGAC GGAAGGACAG 22 49 

AATCAACAGC ATAATGAGCG TGGTCACAAA CACGCTAGTG GAAGAGCTGG 2299 
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AAGAGTCTCA GAGAAAGTGC CCACCGTGCT GGTATAAGTT TGCCAACACT 2 34 9 

TTCCTCATCT GGGAGTGTCA CCCCTACTGG AT AAAAC TG A AGGAGATCGT 2 39 9 

GAACTTAATC GTCATGGACC CTTTTGTAGA CTTAGCCATC ACCATCTGCA 2 44 9 

TCGTTCTGAA TACGCTATTT ATGGCAATGG AGCACCATCC CATGACACCA 2 49 9 

CAGTTCGAAC ACGTCTTGGC CGTAGGAAAT CTGGTGTTCA CCGGGATCTT 2 549 

CACGGCGGAA ATGTTTCTGA AGCTCATAGC CATGGACCCC TACTATTATT 2 599 

TCCAAGAAGG CTGGAACATT TTTGACGGAT TTATTGTCTC CCTCAGTTTA 2 649 

ATGGAGCTGA GTCTCGCAGA TGTGGAGGGG CTCTCAGTGC TGCGGTCTTT 2 699 

CCGACTGCTC CGAGTCTTCA AGCTGGCCAA GTCCTGGCCC ACCCTGAACA 2 749 

TGCTGATCAA GATCATCGGG AACTCCGTGG GTGCCCTGGG CAACCTGACC 2 79 9 

CTGGTGCTGG CCATCATCGT CTTCATCTTC GCCGTGGTGG GGATGCAGCT 2 849 

GTTTGGAAAG AGTTACAAGG AGTGCGTCTG TAAGATCAAC CAGGAGTGCA 2 899 

AGCTCCCGCG CTGGCACATG AACGACTTCT TCCACTCCTT CCTCATCGTC 2 949 

TTCCGAGTGC TGTGTGGGGA GTGGATCGAG ACCATGTGGG ACTGCATGGA 2 99 9 

GGTGGCCGGC CAGGCCATGT GCCTCATTGT CTTCATGATG GTTATGGTCA 3 049 

TTGGCAACCT GGTGG TGCTG AATCTATTCC TGGCCTTGCT TCTGAGCTCC 3 09 9 

TTCAGCGCAG ACAACCTGGC GGCCACAGAC GACGACGGGG AAATGAACAA 3149 

CCTGCAGATC TCAGTGATCC GGATCAAGAA GGGCGTGGCC TGGACCAAAG 3199 

TGAAGGTGCA CGCCTTCATG CAGGCTCACT TCAAGCAGCG GGAGGCGGAT 3249 

GAAGTGAAAC CCCTCGACGA GCTGTATGAG AAGAAGGCCA ACTGCATCGC 3299 

CAACCACACG GGCGTGGATA TCCACCGGAA CGGCGACTTC CAGAAGAACG 3 349 

GGAACGGAAC CACCAGCGGC ATCGGCAGCA GCGTGGAGAA GTACATCATC 3 399 

GACGAGGACC ACATGTCCTT CATTAACAAC CCAAACCTGA CCGTCCGGGT 3 449 

GCCCATTGCT GTGGGCGAGT CTGACTTCGA GAACCTCAAC ACAGAGGATG 3 499 

TTAGCAGCGA ATCAGACCCT GAAGGCAGCA AAGATAAACT GGACGATACC 3 549 

AGCTCCTCAG AAGGAAGTAC CATCGACATC AAGCCTGAGG TGGAAGAAGT 3 599 

TCCCGTGGAG CAACCTGAGG AATACTTGGA TCCGGACGCC TGCTTTACAG 3 649 

AGGGTTGCGT CCAGCGGTTC AAGTGCTGCC AGGTCAACAT CGAGGAAGGA 3 699 

CTAGGCAAGT CGTGGTGGAT CTTGCGGAAA ACCTGCTTCC TCATTGTGGA 3749 

GCACAATTGG TTTGAGACCT TCATCATCTT CATGATTCTG CTCAGCAGTG 3799 

GCGCCCTGGC CTTTGAGGAC ATCTACATTG AGCAGAGGAA GACCATCCGC 3 849 

ACCATCCTGG AGTATGCGGA CAAGGTCTTC ACCTACATCT TCATCCTGGA 3 899 

GATGTTGCTC AAGTGGACAG CCTACGGCTT CGTCAAGTTC TTCACCAATG 3 949 

CCTGGTGCTG GTTGGACTTC CTCATTGTGG CTGTCTCTTT AGTCAGCCTT 3 999 

ATAGCTAATG CCCTGGGCTA CTCGGAACTA GGTGCCATAA AGTCCCTTAG 4049 

GACCCTAAGA GCTTTGAGAC CCTTAAGAGC CTTATCACGA TTTGAAGGGA 4099 
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TGAGGGTGGT GGTGAATGCC TTGGTGGGCG CCATCCCCTC CATCATGAAT 4149 

GTGCTGCTGG TGTGTCTCAT CTTCTGGCTG ATTTTCAGCA TCATGGGAGT 4199 

TAACCTGTTT GCGGGGAAAT ACCACTACTG CTTTAATGAG ACTTCTGAAA 4249 

TCCGGTTCGA AATCGATATT GTCAACAATA AAACGGACTG TGAGAAGCTC 4299 

5 ATGGAGGGCA ACAGCACGGA GATCCGATGG AAGAATGTCA AGATCAACTT 4 349 

TGACAATGTC GGAGCAGGGT ACCTGGCCCT TCTTCAAGTG GCAACCTTCA 43 99 

AAGGC TGGAT GGACATCATG TATGCGGCTG TAGATTCCCG AAAGCCAGAC 4449 

GAGCAGCCTG ACTACGAGGG CAACATCTAC ATGTACATCT ACTTCGTCAT 44 99 

CTTCATCATC TTCGGCTCCT TCTTCACCCT CAACCTGTTC ATCGGTGTCA 4 549 

10 TCATCGACAA CTTCAACCAG CAGAAGAAAA AGTTTGGAGG TCAGGACATC 459 9 

TTCATGACAG AGGAACAGAA GAAGTACTAC AATGCCATGA AAAAGCTGGG 4649 

CTCCAAGAAG CCACAGAAGC CCATCCCCCG ACCCTTGAAC AAAATCCAAG 469 9 

GGATTGTCTT TGATTTCGTC ACTCAACAAG CCTTTGACAT TGTGATCATG 4749 

ATGCTCATCT GCCTTAACAT GGTGACAATG ATGGTGGAGA CAGACACTCA 4799 

15 GAGCAAGCAG ATGGAGAACA TTCTTTACTG GATTAATCTG GTCTTTGTCA 4 849 

TCTTCTTCAC CTGCGAGTGT GTGCTCAAAA TGTTTGCCTT GAGACACTAC 4899 

TATTTCACCA TTGGCTGGAA CATCTTTGAC TTTGTGGTGG TCATCCTCTC 4949 

CATTGTGGGA ATGTTCCTGG CTGATATCAT TGAGAAGTAC TTCGTCTCCC 4999 

CAACCCTATT CCGAGTTATC CGATTGGCCC GTATTGGGCG CATCTTGCGT 5049 

20 CTGATCAAGG GCGCCAAAGG GATCCGCACC CTGCTCTTTG CCTTAATGAT 5099 

GTCGCTGCCC GCCCTGTTCA ACATCGGCCT CCTGCTCTTC CTCGTCATGT 5149 

TCATCTTCTC CATTTTTGGC ATGTCCAACT TCGCATACGT GAAGCACGAG 5199 

GCCGGCATTG ACGACATGTT CAACTTCGAG ACATTTGGCA ACAGCATGAT 52 49 

CTGTTTGTTC CAGATCACAA CGTCTGCTGG CTGGGATGGC CTGCTGCTGC 5299 

25 CAATCCTGAA CCGCCCCCCT GACTGCAGCT TGGACAAAGA GC AC CCAGGG 53 49 

AGTGGCTTCA AAGGGGACTG TGGGAACCCC TCGGTGGGCA TCTTCTTCTT 53 99 

TGTGAGCTAC ATCATCATCT CCTTCCTGAT TGTGGTGAAC ATGTACATCG 5449 

CCATCATCCT GGAGAACTTC AGCGTGGCCA CCGAGGAGAG CGCCGACCCT 5499 

CTGAGTGAGG ATGACTTCGA GACTTTCTAT GAGATCTGGG AGAAGTTTGA 5549 

30 CCCAGACGCC ACCCAGTTCA TCGAGTACTG TAAGCTGGCA GACTTTGCCG 5599 

ACGCCCTGGA GCACCCGCTC CGAGTACCCA AGCCCAACAC CATCGAGCTC 5649 

ATCGCCATGG ACCTGCCCAT GGTGAGCGGA GATCGCATCC ACTGCTTGGA 5699 

CATCCTTTTC GCCTTCACCA AGCGAGTCCT GGGAGACAGT GGGGAGTTGG 57 49 

ACATCCTGCG GCAGCAGATG GAGGAGCGGT TCGTGGCATC CAATCCTTCC 5799 

35 AAAGTGTCTT ACGA-GCCTA TCACAACCAC TCTGCGGCGC AAGCAGGAGG 5848 

AGGTGTCTGC AGTGGTCCTG CAGCGTGCCT ACAGGGGACA CTTGGCTAGG 5898 
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C GGGGCTTC A 


TCTGCAGAAA 


GATGGCCTCC 


AACAAGCTGG 


AGAATGGAGG 


5948 


LALALACAGA 


GACAAGAAGG 


AGAGCACCCC 


GTCCACAGCC 


TCCCTCCCCT 


5998 


L. I 1 AL.GACAG 


CGTCACAAAG 


CCAGACAAGG 


AGAAGCAGCA 


GCGTGCGGAG 


6048 




GGGAAAGAGC 


CAAGAGGCAA 


AAAGAGGTCA 


GGGAGTCCAA 


6098 




AGLjGGAAAGG 


AAGCTTACCC 


CGGCTGAACA 


CTGGCAAGTG 


6148 


AAAL^L. I TGTT 


TACAAACTTC 


CGAATCTCAC 


GGATGCAGAG 


CAGCTGTGCA 


6198 


ljal,gc_ rcGCT 


GTACTGGAAG 


ACCTATACCA 


AACATAGTCT 


GCTTACATGT 


6248 


LjAL,ATGGTGG 


CATCCTGAGC 


GGTGA CT 


GCTGGGGACA 


AAGGACCCTG 


6295 


CTCCCTGGAC 


TCACAGATCT 


CCTATCGCTT 


GGGCAGACGG 


TTACTGCATG 


6345 


TTCCACACTT 


AGTCAATGCA 


ACTTAGGACT 


AAACTAACCA 


GGATACAAAA 


6395 


CCGAGGCGGC 


TGCCGGGACC 


AGCAGATCAC 


CGCTGCAGCC 


AAATGGATTT 


6445 


TATTTTTTCA 


TTTTGTTGAT 


TCTCAGAAGC 


AGAAAGCATC 


ACTTTAAAAG 


6495 


TTTGTTTGTT 


CATNCAAACA 


ATATTTGAAT 


TCTTACATTA 


GTTAAGCTAA 


6545 


GCANCAAAAA 










6556 











BNSDOCID; <WO 9838302A2J_> 



WO 98/38302 PCT/EP98/00997 

-59- 

Claims 

1 . An isolated DN A sequence comprising the nucleotide sequence set forth in 
SEQ ID NO:7. 

5 

2. The isolated DNA sequence of claim 1 comprising the nucleotide sequence 
as set forth in SEQ ID NO: 1 . 

3. The DNA of Claim 2 wherein said DNA sequence is encoding a sodium 
1 0 channel protein. 

4. The DNA of Claim 3 wherein said sodium channel protein is the a-subunit. 

5. The DNA of Claim 4 wherein said sodium channel protein is tetrodotoxin- 
1 5 sensitive. 

6. The DNA of Claim 5 wherein said sodium channel protein is found in 
mammals. 

20 7. The DNA of Claim 5 wherein said sodium channel protein is found in rat. 

8. The DNA of Claim 5 wherein said sodium channel protein is found in 

human. 

25 9. The DNA of Claim 2 wherein said DNA is cDNA. 

1 0. The DNA of Claim 2 wherein said DNA is synthetic DNA. 

1 1 . Expression vectors comprising a DNA as claimed in claims 2 to 10. 

30 

1 2. Host cells transformed with an expression vector of Claim 1 1 . 

13. A cDNA library comprising a host cell of Claim 12. 

35 14. A recombinant polynucleotide comprising a nucleic acid sequence derived 

from a DN A sequence as claimed in claims 2 to 10. 
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15. A tetrodotoxin-sensitive sodium channel protein encoded by a DNA of 
claims 2 to 10 or allelic variants thereof. 

5 1 6. The protein having the amino acid sequence set forth in SEQ ID NO:3. 

17. An assay for inhibitors of tetrodotoxin-sensitive sodium channel protein 
encoded by a DNA of claims 2 to 10 or the amino acid sequence as set forth in SEQ ID 
NO:3 comprising contacting a compound suspected of being said inhibitor with expressed 

1 0 sodium channel protein and measuring the activity of said expressed sodium channel 
protein. 

1 8. An isolated DNA sequence comprising the nucleotide sequence set forth in 
SEQ ID NO:7 having a 30 base pair insert after base pair number 2050, the 30 base pair 

1 5 insert comprising gtgaaaatagataaggcagctacggacagc. 

1 9. The isolated DNA sequence of claim 1 8 comprising the nucleotide sequence 
as set forth in SEQ ID NO:2. 

20 20 - The DNA of C1 aim 1 9 wherein said DNA sequence is encoding a sodium 

channel protein. 



25 



2 1 . The DNA of Claim 20 wherein said sodium channel protein is the a- 



subunit. 

22. 
sensitive. 



The DNA of Claim 21 wherein said sodium channel protein is tetrodotoxin- 



23. The DNA of Claim 22 wherein said sodium channel protein is found 
30 mammal. 



in 



24. The DNA of Claim 22 wherein said sodium channel protein is found in rat. 



35 human. 



25. The DNA of Claim 22 wherein said sodium channel protein is found 



in 
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26. The DNA of Claim 19 wherein said DNA is cDNA. 



27. The DNA of Claim 19 wherein said DNA is synthetic DNA. 

28. Expression vectors comprising a DNA as claimed in claims 19 to 27. 

29. Host cells transformed with an expression vector of Claim 28. 



30. A cDNA library comprising a host cell of Claim 29. 

10 

31. A recombinant polynucleotide comprising a nucleic acid sequence derived 
from a DNA sequence as claimed in claims 19 to 27. 

32. A tetrodotoxin-sensitive sodium channel protein encoded by a DNA of 
15 Claims 19 to 27 or allelic variants thereof. 

33. The protein having the amino acid sequence set forth in SEQ ID NO:4. 

34. An assay for inhibitors of tetrodotoxin-sensitive sodium channel protein 
20 encoded by a DNA of claims 19 to 27 or the amino acid sequence as set forth in SEQ ID 

NO:4 comprising contacting a compound suspected of being said inhibitor with expressed 
sodium channel protein and measuring the activity of said expressed sodium channel 
protein. 

25 35. An isolated polynucleotide probe comprising the nucleotide sequence set 

forth in SEQ ID NO:5 and complements thereof. 

36. The protein having the amino acid sequence set forth in SEQ ID NO:6. 

30 37. A polynucleotide probe comprising the polynucleotide of Claim 35 bound 

to a reporter molecule. 

38. A method of growing plasmids containing constructs of tetrodotexin- 
sensitive sodium channel proteins comprising: 

35 
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employing competent E. coli cell lines for primary transformations which allow the 
stable propagation of plasmids containing unstable inserts following ligation reactions; 

employing 1/2 strength freezing media mixed with a broth chosen from the group 
comprising full strength YENB, full strength YET, and full strength LB, the broth being 
5 mixed with agar for solid media; 

employing full strength freezing media plus 1/2 strength LB for liquid media; 

employing carbenicillin as an antibiotic; 

providing a temperature no greater than 30° C; and 

growing the plasmids for periods longer than 20 hours. 

10 

39. The method according to claim 38 employing competent E. coli cells for 
secondary transformants. 

40. The method according to claim 38 wherein the temperature is 28° C. 

4 1 . The method according to claim 3 8 wherein the growing period is in the range 
of approximately 36 to 48 hours. 

42. The method according to claim 38 having a carbenicillin concentration within 
20 the range of 50-200 |Xg/ml. 

43. The method according to claim 38 wherein the carbenicillin has a 
concentration within the range of 75-125 |Xg/ml. 

25 44 - The method according to claim 38 wherein the carbenicillin has a 

concentration of 100 |lg/ml. 

45. Antibodies against a tetrodotoxin-sensitive sodium channel protein as 
claimed in claims 15, 16, 32 and 33. 



15 



30 



46. The use of a tetrodotoxin-sensitive sodium channel protein as claimed in 



claims 15, 16, 32 and 33 for identifying inhibitors of their activity. 

47. A method of producing a tetrodotoxin-sensitive sodium channel protein as 
35 claimed in claims 15, 16, 32 and 33, comprising cultivating a host cell as claimed in claim 12 
or 29 in a suitable medium and optionally isolating said channel protein. 
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48. Proteins as claimed in claims 15, 16, 32 and 33 prepared by the method of 
claim 47. 

5 49. Proteins as claimed in claims 15, 16, 32 and 33 for identifying inhibitors of 

their activity. 

50. The invention substantially as hereinbefore described, especially with 
reference to the examples. 

10 

*** 
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Fig. 1A: SEQ IDN0:3 

1 MAARLLAPPG PDSFXPFTPS SLAM I ERR I A E S KLKKP P KA DG SHREDDED 

51 SXPKPNSDLS AGXSLPFIYG DIPQGLVAVP LZDFDPYYLT QKTFVVLNRG 

101 KTLFRFSAT? ALYIL5PFNL IRRIAIKILI HSVFSMIIMC TIL/TNCVFMT 

I- ISl 1 

151 FSNPPEWSKN VZYTFTGIYT FESLVKIXAR GFCIDGFTFL RDPWHWLDFS 

I" IS2 1 |__ 

201 VIMMAYVTE? VDLGNVSALR TFRVLRALKT ISVIPGLKTI VGAL.IQSVKK 
IS3 1 ♦ | IS4 , 

2 51 LS D VM I LTVF CLSVFALIGL QLFMGNLRNK CWWPINFNE SYLENGTRGF 

•I- "5 1 ♦ ♦ 

3 01 DWEEYINNXT NFYMVPGMLE PLLCGNSSDA GQCPEGFQCM KAGRNPNYGY 

♦ ♦ 
351 TSFDTFSWAF LALFRLMTQD YWENL. YQ LTL RAAGKTYMIF FVLVIFVGSF 

A | IS6- 

4 01 YLVNLILAW AMAYEEQNQA TLEEAEQKEA EFKAMLEQLK KQQEEAQAAA 

I 

4 51 MAT SAG TVS E DAIEEEGEDG VGSPRSSSEL SKLSSKSAXE RRNRRXXRKQ 

501 K2LSSGZSKG DPEKVFKSES EDGMRRKAFR LPDNRIGRXF SIMNQSLLSI 

551 PGSPFLSRHN SKSSIFSFRG PGRFRDPGSE NEFADDEHST VEESEGRRDS 

SOI LFIPIRARER RSSYSGYSGY SQCSRSSRIF PSLRRSVKRN STVDCNGWS 

S51 LIGPGSHIGR LLPEATTEVS IKXXGPGSLL VSMDQLASYG RKDRINS IMS 

701 WTNTLVEEt EESQRKCPPC WYKFANTFLI WECHPYWIKL KHIVNLIVMD 

| 

7S1 PFVDLAITIC IVLNTLFMAM EHHPMTPQFE HVLAVGNLVF TG I FTAEMFL 



IIS1 —I I IIS2- 



801 KLIAMDPYYY FQEGWNIFDG FIVSLSLMSL SLADVEGLSV LRS FRLLRVF 
! i --IIS3 | | IIS4 
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Fig. IB: SEQIDNO:3 

351 KLAKSWPTLN MLIXIIGNSV GALGNLTLVL AIIVFIFAW GMQLFGXSYX 
1 ♦ | uss 1 

901 ZCVCKINQEC XLPRWHMNDF FHSFL.IVFRV LCGEWIETMW DCMSVAGQAM 

t- 

951 CLIVFMMVMV IGMLVVLNLF LALLLSSFSA DNLAATDDDG EMNNLQISVI 
IIS5 1 

1001 RIKKGVAWTK VKVHAFMQAH FKQREADEVX PLDELYEKKA NCIANHTGVD 

♦ 

1051 IHRNGDFQKN GNGTTSGIGS SVEKVIIDED KMSFINNPNL TVRVPIAVGE 

♦ ♦ 
1101 SDFENLNTED VSSESDPEGS KDKLDDTSSS EGSTIDIKPE VEEVPVEQPE 

1151 EYUDPDACFT EGCVQRFKCC QVNIEEGLGK SWWILRKTC? LIVEHNWFET 

| 

1201 FIIFMILLSS GAliAFEDIYI EQRKTTRTTL EYADKVFTYI F I LEMLLKWT 
IIIS1 | | IIIS2 

1251 AYGFVXFFTN AWCWLDFLIV AVSLVSLIAN AZ.GYSELGAI KSLRTLRALR 
--| | IIIS3 | | --IIIS4- 

13 01 PLRALSRFEG MRVWNALVG AIPSIMNVLL VCLIFWLIFS IMGVNLr AGK 

1 I hiss 1 

1331 YKYCFNETSE IRFEIDIVNN KTDCEKLMEG NS7EIRWKNV KINFDNVGAG 

♦ ♦ ♦ 

1401 YLALLQVATF KGWMDIMYAA VDSRKPDEQ? DYEGNIYMYI YFVIFIIFGS 

|- IIISo 

14 51 FFTLNLFIGV IIDNFNQQKX KFGGQDIFMT SEQKKYYNAM KXjLGS KK? Q X 

1 

1501 PIPRPLNKIQ GIVFDFVTCQ AFDIVIMMLI CLNMVTMMVE TDTQSKQMEN 



1551 ILYWINLVFV IFFTCECVLK MFALRHYYFT IGWNIFDFW VILSIVGMFL 
| IVS2 I I IVS3-- 

15 01 ADIIEKYFVS ?TLF?,VIRI^ RXGRILRLIK GAXGIRTLLF ALMMSLPALF 
1 1" IVS4 | 

1SS1 N'IGLLLFLVrt FIFSIFGMSN FAYVKHEAGX DDMFNFETFG N3MICLFQIT 
! IVS5 1 
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Fig. 1C: SEQ ID N0:3 

1701 TSAGWDGLLL PILNRPPDCS LDKEKPGSGF XGDCGNPSVG IFFFVSYIII 

| 

1751 SFLIWNMYI AIILENFSVA TEESADPLSS DDFETFYEIW EKFDPDATQF 
IVSS 1 ♦ 

1301 IEYCKLADFA DALEKPLRVP KPNTIELIAM DLPMVSGDRI KCLDILFAFT 

1851 KRVLGDSGEL DILRQQMEER FVASNPSKVS YEPITTTLRR KQEEVSAWL 

1901 QRAYRGHLAR RGFICRKMAS NKLENGGTKR DKXSSTPSTA SLPSYDSVTK 

1951 PDKZKQQRAS EGRRERAKRQ KEVRESKC 
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Fig. 2A:-SEQ ID NO:4 

I MAARLLA? PG PDSFKPFTPE SLANIZRRIA ESXLXXPPXA DGSHREDDED 
51 SKPKPNSDLE AGKSLPFIYG DIPQGLVAVP LZDFDPYYLT Q KT FWLNRG 
101 KTLFRFSAT? ALYILSPFNL IRRIAIKILI KSVFSMIIMC TILTNCVFMT 

I- - — isi ! 

151 FSNPPEWSXN VEYTFTGIYT FZSLVKIIAR GFCIDGFTFL, 3JDPWNWLDFS 



201 VIMMAYVTEF VDLGNVSALR T FR.VLRALKT ISVIPGLKTI VGALIQSVKX 

IS3 1 ♦ I * S4 1 

251 LSDVMILTVF CLSVFALIGL Q L FMGNLRNK CWWPINFNZ SYLENGTRGF 

•| IS5 1 ♦ ♦ 

3 01 DWEZYINNKT NFYMVPGMLE PLLCGNSSDA GQCPEGFQCM KAGRNPNYGY 
♦ ♦ 

3 51 TSFDTFSWAF LALFRLMTQD YWENLYQLTL RAAGKTYMIF FVLVIFVGSF 

A | ISS 

401 YU/NLILAW AMAYEEQNQA TLEEAEQKZA EFKAMLEQLK KQQEEAQAAA 
1 

4 51 MATS AG TVS E DAIEEEGEDG VGSPRSSSEL SKLSSXSAX2 RRNRRKXRKQ 

5 01 XELSZGEE2CG DPExO/FKSES EDGMRRKAFR LPDNRIGRXF SIMNQSLLSI 

5 51 PGSPFLSRHN SKSSIFSFRG PGRFRDPGSE NEFADDEHST VEESEGRRDS 
601 LFIPIRARER RSSYSGYSGY SQCSRSSRIF PSLRRSVKRN STVDCNGWS 

6 51 LIGPGSHIGR LLPEVXIDKA ATDS ATTEVE IKXXGPGSLL VSMDQLASYG 

701 RXDRINSIMS WTNT LVE EL EESQRKCPPC WYKFANTFLI WECKPYWIKL 

751 X2IVNLIVMD PFVDLAITIC IVLNTLFMAM EHHPMTPQFS HVLA VGNI*V F 
| IIS1 1 | 

801 TGIFTAEMFL iCLIAMDPYYY FQEGWNIFDG FIVSLSLMEL SLADVEGLSV 
IIS2 --| | --IIS3 | | 
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Fig. 2B: SEQ ID NO:4 

8 51 LRSFRLLRVF KLAXSWPTLN MLIKIIGNSV GALGNLTLVL AIIVFIFAW 



l *| IIS5- 

901 GMQLFGXSYX ECVCXINQEC XLPRWHMNDF FKSFLIVFRV LCGSWIETMW 



951 DCMSVAGQAM CLIVrMMVMV I GNLWLNL F LALLLSSFSA DNLAATDDDG 

I -«ss -| 

1001 EMNNLQISVI RIXXGVAWTK VXVHAFMQAH FXQREADEVK PLDELYEXXA 

10 51 NC I ANHTGVD IHRNGDFQKN GNGTTSGIGS SV2XYIIDED HMSFINNPNL 

1101 TVRVPIAVGS SDFENLNTED VSSSSDPEGS XDXLDDTSSS EGSTIDIKPE 

1151 VEEVPVEQPS EYLDPDACFT EGCVQRFXCC QVNIESGLGX SWWILRKTCF 

1201 LIVSHNWFET FIIFMILLSS GALAFEDIYI EQRKTIRTIL EYADXVFTYI 

I IIIS1 1 |.._ 

1251 FILEMLLXWT AYGFVXFFTN AWCWLDFLIV AVSLVSLIAN ALG YS ELGAI 
IIIS2 "-- I I" IIIS3 — I I-- 



1301 KSLRTLRALR PLRALSRFEG MRVWNALVG AIPSIMNVLL VCLIFWLIFS 

— 11134 "I I HISS 

1351 IMGVNLFAGX YHYCFNETSS IRFEIDIVNN KTDCEXLMSG NSTEIRWKNV 



♦ ♦ ♦ 



1401 KINFDNVGAG YLALLQVATF XGWMDIMYAA VDSRKPDEQP DYSGNIYMYI 

I — 

1451 YFVIFIIFGS FFTLNLFIGV IIDNFNQQKX KFGGQD I FMT EEQKXYYNAM 

HISS 1 

1S01 XXLGSXXPQX PIPRPLNXIQ GIVFDFVTQQ AFDIVIMMLI CLNMVTMMVE 

I --IVS1 1 

1SS1 TDTQSKQMSN ILYWINLVFV IFFTCECVLK MFALRKYYFT IGWNIFDFW 

I IVS2 | | 

1S01 VILSIVGMFL ADIISXYFVS PTLFRVIRLA RIGRILRLIX GAXGIRTLLF 



-- IVS3 1 I IVS4- 



1SS1 ALMMSLPALF NIGLLLFLVM FIFSIFGMSN FAYVXHSAGI DDMFNFETFG 

I IVSS I 
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Fig. 2C:SEQ IDNO:4 

1701 NSMICLFQIT TSAGWDGLLL PILNP.PPDCS LDKZKPGSG? KGDCGNPSVG 

I- 

1731 IFFFVSYIII S7LIWNMYI AIILENFSVA TEESADPLS2 DDFETFYSIW 

IV ss — !♦ 

13 01 EKFDPDATQF IEYCKLADFA DALEKPLRVP KPNTIELIAM DLPMVSGDRI 
13 51 KCLDILFAFT KRVLGDSGEL DILRQQMEER FVASNPSKVS YEPITTTLRR 

1901 :-;qesvsawl QRAYRGKLAR rgficrkmas nxlenggthr DKKESTPSTA 

1951 SLPSYDSVTK PDKEXQQRAE EGRRERAKRQ KEVRESKC 
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Fig. 3 A; NaCh6/PN4 alignment (SEQ ID NO:7) 



RATNaChSA 














r PN4 


CCAAGATGGC 


GCCCACCGCA 


G7CCCGCCCG 


CCGCAGCCTC 


GGCGCCTCTG 


50 


RATNaChSA 














rPN4 


CAGTCCGGCC 


GCGCCTCCCG 


GGCCCCZCGC 


TAGGGCCGCT 


GCCGCCTCGC 


100 


RATNaChSA 














rPN4 


CCGCCGCCGC 


CGCCGCCAGC 
start=> 


TGACCTGTCC 


CGGACACATA 


ACTAACGAAG 


150 


RATNaChSA 
r?N4 


CTGCTGCAGG 


AXGAGAAGAT 
ATGAGAAGAI 


--CGGCGCGG 
5GCAGCGCGG 


CTGCTCGCAC 
CTGCTCGCAC 


CACCAGGCCC 
CACCAGGCCC 


38 
200 






stares 








RATNaChSA 
r?N4 


TGATAGTTTC 
TGATAGTTTC 


AAGCCTTTCA 
AAGCCTTTCA 


CCCCTGAGTC 
CCCCTGAGTC 


GCTGGCAAAC 
ri r* tci rzr^ Air* 


ATCGAGAGGC 
*\ iLGAGAGG^ 


88 
250 


RATNaChSA 
rPN4 


GTATTGCCGA 
GTATTGCCGA 


GAGCAAGCTC 
GAGCAAGCTC 


AAGAAACCAC 
AAGAAACCAC 


CAAAGG CGG A 
c A a a czrz r*n rz a 


TGGCAGCCAC 
I vjULAGCCAC 


138 
3 00 


RATNaChSA 
rPN4 


CGGGAGGACG 
CGGGAGGACG 


ATGAAGACAG 
ATGAAGACAG 


CAAGCCCAAG 
CAAGCC CAAG 


CCAAACAGTG 


ACCTGGAGGC 
ALL i GvjAGGC 


188 
350 


RATNaChSA 
rPN4 


TGGGAAGAGT 
TGGGAAGAGT 


TTGCCTTTCA 
TTGCCTTTCA 


TCTACGGGGA 
TCTACGGGGA 


CATCCCGCAA 


GGCCTGGTTG 
ubLLTGGTTG 


238 
400 


RATNaChSA 
rPN4 


CGGTTCCCCT 
CGGTTCCCCT 


GGAGGACTTT 
GGAGGACTTT 


GACCCTTACT 
GACCCTTACT 


ATTTGACGCA 

«v x x 1 wn^VJUn 


GAAAACCTTT 


288 
4 50 


RATNaChSA 
rPN4 


GTAGTATTAA 
GTAGTATTAA 


ACAGAGGGAA 
A C AG AG G G AA 


AACTCTCTTC 
AACTCTCTTC 


AGATTTAGTG 
AGATTTAGTG 


CCACACCTGC 


338 

r r\ H 


RATNaChSA 
rPN4 


CTTGTACATT 
CTTGTACATT 


TTAAGCCCTT 
TTAAGCCCTT 


TTAACCTGAT 
TTAACCTGAT 


AAGAAGAATA 
AAGAAGAATA 


GCTATTAAAA 

GCTATT A A A £ 


3 SB 

c c n 


RATNaChSA 
rPN4 


TTTTGATACA 
TTTTGATAC A 


CTCAGTTTTC 
CTCAGTTTTC 


AGCATGATCA 
AGCATGATCA 


TCATGTGCAC 
TCATGTGCAC 


CATCCTGACC 
CATCCTGACC 


436 
SOO 


RATNaChSA 
rPN4 


AACTGTGTGT 
AACTGTGTGT 


TCATGACCTT 
TCATGACCTT 


TAGTAACCCT 
TAGTAACCCT 


CCAGAATGGT 
CCAGAATGGT 


CCAAGAATGT 
CCAAGAATGT 


488 

S50 


RATNaChSA 
rPN4 


GGAGTACACA 
GGAGTACACA 


TTCACAGGGA 
TTCACAGGGA 


TTTACACATT 
TTTACACATT 


TGAATCACTA 
TGAATCACTA 


GTGAAAATCA 
GTGAAAATCA 


538 
700 


RATNaChSA 
rPN4 


TCGCAAGAGG 
TCGCAAGAGG 


TTTCTGCATA 
TTTCTGCATA 


GACGGCTTCA 
GACGGCTTCA 


CCTTCTTACG 
CCTTCTTGCG 


AGACCCGTGG 
AGACCCGTGG 


588 
750 


?-ATNaChSA 
rPN4 


AACTGGTTAG 
AACTGGTTAG 


ACTTCAGTGT 
ACTTCAGTGT 


CATCATGATG 
CATCATGATG 


GCATATGTGA 
G CATATGTG A 


CAGAGTTTGT 
CAGAGTTTGT 


S38 
800 


RATNaChSA 
rPN4 


GGACCTGGGC 
GGACCTGGGC 


AATGTCTCAG 
AATGTCTCAG 


CGCTGAGAAC 
CGCTGAGAAC 


ATTCAGGGTT 
ATTCAGGGTT 


CTCCGAGCTT 
CTCCGAGCTT 


638 
8S0 
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Fig. 3B:- NaCh6/PN4 alignment (SEQ ID NO:7) 



RATNaChSA TGAAAACTAT CTCTGTAATT CCAGGCCTGA AGACAATCGT GGGCGCCCTA 73 8 

r?M4 . TGAAAACTAT CTCTGTAATT CCAGGCCTGA AGACAATCGT GGGCGCCCTA * 900 

RATNaChSA ATCCAGTCCG TG AAG AAG CT GTCGGACGTG ATGATCCTGA CAGTGTTCTG 78 8 

r?N4 ATCCAGTCCG TG AAG AAG CT GTCGGACGTG ATGATCCTGA CAGTGTTCTG 950 

RATNaChSA CCTGAGTGTT TTCGCCCTGA TTGGCCTGCA GCTCTTTCAT GGGAACCTTT 83 8 

rPN4 CCTGAGTGTT TTCGCCCTGA TTGGCCTGCA GCTCTTCATG GGGAACCTT - 999 

RATNaChSA CGAAAC-AGT GTGTCGTGTG GCCCATAAAC TTCAACGAGA GCTACCTGGA 8 87 

2T-PN-4 CGAAACAAGT GTGTCGTGTG GCCCATAAAC TTCAACGAGA GCTACCTGGA 1043 

RATNaChSA GAACGGCACC AGAGGCTTTG ACTGGGAGGA ATATATCAAC AATAAAACAA 93 7 

rPN4 GAACGGCACC AGAGGCTTTG ACTGGGAGGA ATATATCAAC AATAAAACAA 1099 

RATNaChSA ACTTTTACAT GGTTCCTGGC ATGCTAGAAC CCTTGCTCTG CGGGAACAGT 98 7 

rPN4 ACTTTTACAT GGTTCCTGGC ATGCTAGAAC CCTTGCTCTG CGGGAACAGT 114 9 



RATNaChSA TCTGATGCTG GGCAATGC-- -GAAGGATTC CAGTGCAGTA AAGCAGGAAG 1034 
rPN4 TCTGATGCTG GGCAATGCCC AGAGGGATTC CAGTGCATGA AAGCAGGAAG 1199 



RATNaChSA GAACCCCAAC TACGGTTACA CCAGCTTTGA CACCTTCAGC TGGGCCTTCT 1084 

r?N4 GAACCCCAAC TACGGTTACA CCAGCTTTGA CACCTTCAGC TGGGCCTTCT 124 9 

RATNaChSA TGGCATTATT CCGCCTTATG ACCCAGGACT ATTGGGAGAA CTTATAC C AG 113 4 

rPN4 TGGCATTATT CCGCCTTATG ACCCAGGACT ATTGGGAGAA CTTATAC C AG 12 9 9 

RATNaChSA CTGACCTTAC GAGCCGCTGG GAAAACGTAC ATGATCTTCT TTGTCTTGGT 118*4 

rPN4 CTGACCTTAC GAGCCGCTGG GAAAACGTAC ATGATCTTCT TTGTCTTGGT 134 9 

RATNaChSA CATCTTCGTG GGTTCTTTCT ATCCGGTGAA CTTGATCTTG GCTGTGGTGG 1234 

rPN4 CATCTTCGTG GGTTCTTTCT ATCTGGTGAA CTTGATCTTG GCTGTGGTGG 13 99 

RATNaChSA CCATGGCTTA TGAGGAACAG AACCAGGCAA CACTGGAGGA GGCAG AG C AA 1284 

r?N4 CCATGGCTTA TGAGGAACAG AACCAGGCAA CACTGGAGGA GGCAGAGCAA 144 9 

RATNaChSA AAAGAGGCCG AGTTCAAGGC AATG CTGG AG CAACTCAAGA AGCAGCAGGA 13 34 

r?N4 AAAGAGGCCG AGTTCAAGGC AATG CTGG AG CAACTCAAGA AGCAGCAGGA 14 9 9 

RATNaChSA GGAGGCACAG GCTGCTGCAA TGGCCACCTC AGCGGGCACT GTCTCGGAAG 1384 

rPN4 GGAGGCACAG GCTGCTGCAA TGGCCACCTC AGCGGGCACT GTCTCGGAAG 154 9 

RATNaChSA ACGCCATTGA AGAAGAAGGG GAAGATGGGG TAGGCTCTCC GAGGAGCTCT 143 4 

rPM4 ACGCCATTGA AGAAGAAGGG GAAGATGGGG TAGGCTCTCC GAGGAGCTCT 15 99 

RATNaChSA TCTGA^CTGT CTAAACTCAG TTCCAAGAGC GCGAAGGAGC GGCGGAACCG 1484 

rPN4 T CTG AACTGT CTAAACTCAG TTCCAAGAGC GCGAAGGAGC GGCGGAACCG 164 9 

RATNaChSA ACGGAAGAAG AGGAAGCAGA AGGAGCTCTC TGAAGGCGAG GAGAAAGGGG 15 3 4 

r?N4 ACGGAAGAAG AGGAAGCAGA AGGAGCTCTC TGAAGGCGAG GAGAAAGGGG 15 9 9 
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Fig. 3C: NaCh6/PN4 alignment (SEQ ID NO:7) 



RATNaChSA 
rPN4 

RATNaChSA 
rPN4 

RATNaChSA 
r?N4 

RATNaChSA 
rPN4 

RATNaCh6A 
rPN4 

RATNaChSA 
rPN4 

RATNaChSA 
rPN4 

RATNaChSA 
rPN4 

RATNaCh6A 
rPN4 

RATNaChSA 
r?N4 

RATNaChSA 
rPN4 

RATNaChSA 
rPN4 

RATNaChSA 
rPN4 

RATNaChSA 
rPN4 

RATNaChSA 
r PN4 

RATNaChSA 
r?N4 

RATNaChSA 
r?N4 

RATNaChSA 
rPN4 



ACCCGGAGAA 



GCCTTCCGGC 
GCCTTCCGGC 

TCAGTCGCTG 
TCAGTCGCTG 

GCAAAAGCAG 
GCAAAAGCAG 

GGCTCTGAGA 
GGCTCTGAGA 

CGAGGGCCGG . 
CGAGGGCCGG 

GCAGCAGCTA 
GCAGCAGCTA 

CGCATCT-CC 
CGCATCTTCC 

CTGCAACGGC 
CTGCAACGGC 

TCCTGC-TGA 
TCCTGCCTGA 

-CTCTTTTAG 
TCTCTTTTAG 

AATCAACAGC 
AATCAACAGC 

AAGAGTCTCA 
AAGAGTCTCA 

TTCCTCATCT 
TTCCTCATCT 

GAACTTAATC 
GAACTTAATC 

TCGTTCTGAA 
TCGTTCTGAA 

CAGTTCGAAC 
CAGTTCGAAC 

CACGGCGGAA 
CACGGCGGAA 



GGTGTTTAAG 
GGTGTTTAAG 

TGCCAGACAA 
TGCCAGACAA 

CTCAGCATTC 
CT C AG C ATT C 

CATCTTCAGC 
CATCTTCAGC 

ATGAGTTCGC 
ATGAGTTCGC 

. CGTGACTCGC 
CGTGACTCGC 

CAGTGGCTAC 
CAGTGGCTAC 

CCAGCCTGC- 
CCAGCCTGCG 

GTAGTGTCAC 
GTAGTGTCAC 

GGCAACGACT 
GGCAACGACT 

TTTCTATGGA 
TTTCTATGGA 

ATAATGAGCG 
ATAATG AG CG 

G AG AAAGTG C 
GAGAAAGTGC 

GGGAGTGTCA 
GGGAGTGTCA 

GTCATGGACC 
GTCATGGACC 

TACGCTATTT 
TACGCTATTT 

ACGTCTTGGC 
ACGTCTTGGC 

ATGTTTCTGA 
ATGTTTCTGA 



TCAGAGTCGG 
TCAGAGTCGG 

CAGGATAGGG 
CAGGATAGGG 

CAGGCTCGCC 
CAGGCTCGCC 

TTC-GGGGAC 
TTCCGGGGAC 

AGACGATGAA 
AGACGATGAA 

TCTTCATCCC 
TCTTCATCCC 

AGCGGCTACA 
AGCGGCTACA 

GCGCAGCGTG 
GCGCAGCGTG 

TCATCGGGCC 
TCATCGGGCC 

GAGGTGGAAA 
GAGGTGGAAA 

CCAACTCGCC 
CCAACTCGCC 

TGGTCACAAA 
TGGTCACAAA 

CCACCGTGCT 
CCACCGTGCT 

CCCCTACTGG 
CCCCTACTGG 

CTTTTGTAGA 
CTTTTGTAGA 

ATGGCAATGG 
ATGGCAATGG 

CGTAGGAAAT 
CGTAGGAAAT 

AG CTCAT AGC 
AGCTCATAGC 



AATACGGTAT 



AAGACGGTAT 

AGGAAGTTTT 
AGGAAGTTTT 

CTTCCTCTCC 
CTTCCTCTCC 

CC-GTCGGTT 
CCGGTCGGTT 

CACAGCACCG 
CACAGCACCG 

GATCCGCGCC 
GATCCGCGCC 

GCCAGTGCAG 
GCCAGTGCAG 

AAGC - CAACA 
AAGCGCAACA 

CGGCTCACAC 
CGGCTCACAC 

TTAAGAAGAA 
TTAAGAAGAA 

TCCTACGGAC 
TCCTACGGAC 

CACGCTAGT- 
CACGCTAGTG 

GGTATAAGTT 
GGTATAAGTT 

ATAAAACTGA 
ATAAAACTGA 

CTTAGCCATC 
CTTAGCCATC 

AGCACCATCC 
AGCACCATCC 

CTGGTGTTCA 
CTGGTGTTCA 

CATGGACCCC 
CATGGACCCC 



G AG AAG G AAG 
GAGAAGGAAG 

CCATCATGAA 
C CATCATGAA 

CGACATAACA 
CGACATAACA 

-CGGGACCCC 
CCGGGACCCC 

TGGAGGAGAG 
TGGAGGAGAG 

CGCGAGCGCC 
CGCGAGCGCC 

CCGCTCGTCG 
CCGCTCGTCG 

GCACGGTGGA 
GCACGGTGGA 

ATCGGGCGGC 
ATCGGGCGGC 

AGGCCCTGGA 
AGGCCCTGGA 

GGAAGGACAG 
GGAAGGACAG 

GAAGAGCTGG 
GAAGAGCTGG 

TGCCAACACT 
TGCCAACACT 

AGGAGATCGT 
AGGAGATCGT 

ACCATCTGCA 
ACCATCTGCA 

CATGACACCA 
CATGACACCA 

CCGGGATCTT 
CCGGGATCTT 

TACTATTATT 
TACTATTATT 



1584 
1749 

1S34 
1799 

1684 
1849 

1731 
1899 

1781 
1949 

1831 
1999 

1881 
2049 

1928 
2099 

1978 
2149 

2027 
2199 

2076 
2249 

2125 
2299 

2175 
2349 

2225 
2399 

2275 
2449 

2325 
2499 

2375 
2549 

2425 
2599 
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Fig. 3D: NaCh6/PN4 alignment (SEQ ID NO:7) 



RATNaChSA TCCAAGAAGG CTGGAACATT TTTGACGGAT TTATTGTCTC CCTCAGTTTA 24 7 5 

rPM4 TCCAAGAAGG CTGGAACATT TTTGACGGAT TTATTGTCTC CCTCAGTTTA 264 9 

RATNaChSA ATGGAGCTGA GTCTCGCAGA TGTGGAGGGG CTCTCAGTGC TGCGGTCTTT 2 525 

rPN4 ATGGAGCTGA GTCTCGCAGA TGTGGAGGGG CTCTCAGTGC TGCGGTCTTT 26 99 

RATNaChSA CCGACTGCTC CGAGTCTTCA AGCTGGCCAA GTCCTGGCCC ACCCTGAACA 2 57 5 

r?N4 CCGACTGCTC CGAGTCTTCA AGCTGGCCAA GTCCTGGCCC ACCCTGAACA 2749 

RATNaChSA TGCTGATCAA GATCATCGGG AACTCCGTGG GTGCCCTGGG CAACCTGACC 262 5 

rPN4 TGCTGATCAA GATCATCGGG AACTCCGTGG GTGCCCTGGG CAACCTGACC 279 9 

RATNaChSA CTGGTGCTGG CCATCATCGT CTTCATCTTC GCCGTGG-TGG GGATGCAGCT 26 7 5 

r?M4 CTGGTGCTGG CCATCATCGT CTTCATCTTC GCCGTGGTGG GGATGCAGCT 284 9 

RATNaChSA GTTTGGAAAG AGTTACAAGG AGTGCGTCTG TAAGATCAAC CAGGAGTGCA 2725 

r?N4 GTTTGGAAAG AGTTACAAGG AGTGCGTCTG TAAGATCAAC CAGGAGTGCA 28 99 

RATNaChSA AGCTCCCGCG CTGGCACATG AACGACTTCT TCCACTCCTT CCTCATCGTC 2 77 5 

rPN4 AGCTCCCGCG CTGGCACATG AACGACTTCT TCCACTCCTT CCTCATCGTC 294 9 

RATNaChSA TTCCGAGTGC TGTGTGGGGA GTGGATCGAG ACCATGTGGG ACTGCATGGA 282S 

rPN4 TTCCGAGTGC TGTGTGGGGA GTGGATCGAG ACCATGTGGG ACTGCATGGA 2999 

RATNaCh6A GGTGGCCGGC CAGGCCATGT GCCTCATTGT CTTCATGATG GTTATGGTCA 2875 

rPN4 GGTGGCCGGC CAGGCCATGT GCCTCATTGT CTTCATGATG GTTATGGTCA 304 9 

RATNaChSA TTGGCAACCT GGTGGTGCTG AATCTATTCC TGGCCTTGCT TCTGAGCTCC 2 92 5 

TPM4 TTGGCAACCT GGTGGTGCTG AATCTATTCC TGGCCTTGCT TCTGAGCTCC 309 9 

RATNaChSA TTCAGCGCAG ACAACCTGGC GGCCACAGAC GACGACGGGG AAATGAACAA 2 97 5 

rPM4 TTCAGCGCAG ACAACCTGGC GGCCACAGAC GACGACGGGG AAATGAACAA 314 9 

RATNaChSA CCTGCAGATC TCAGTGATCC GGATCAAGAA GGGCGTGGCC TGGACCAAAG 3 02 5 

rPN4 CCTGCAGATC TCAGTGATCC GGATCAAGAA GGGCGTGGCC TGGACCAAAG 3199 

RATNaCh6A TGAAGGTGCA CGCCTTCATG CAGGCTCACT TCAAGCAGCG GGAGGCGGAT 3 07 5 

rPN4 TGAAGGTGCA CGCCTTCATG CAGGCTCACT TCAAGCAGCG GGAGGCGGAT 3 24 9 

RATNaChSA GAAGTGAAAC CCCTCGACGA GCTGTATGAG AAGAAGGCCA ACTGCATCGC 312 5 

rPN4 GAAGTGAAAC CCCTCGACGA GCTGTATGAG AAGAAGGCCA ACTGCATCGC 3 299 

RATNaChSA CAACCACACG GGCGTGGATA TCCACCGGAA CGGCGACTTC CAGAAGAACG 317 5 

rPN4 CAACCACACG GGCGTGGATA TCCACCGGAA CGGCGACTTC CAGAAGAACG 3 34 9 

RATNaChSA GGAACGGAAC CACCAGCGGC ATCGGCAGCA GCGTGGAGAA GTACATCATC 3 22 5 

r?N4 GGAACGGAAC CACCAGCGGC ATCGGCAGCA GCGTGGAGAA GTACATCATC 3 3 99 

RATNaChSA GACGAGGACC ACATGTCCTT CATTAACAAC CCAAACCTGA CCGTCCGGGT 3 275 

rPN4 GACGAGGACC ACATGTCCTT CATTAACAAC CCAAACCTGA CCGTCCGGGT 344 9 

RATNaChSA GCCCATTGCT GTGGGCGAGT CTGACTTCGA GAACCTCAAC ACAGAGGATG 3 32 5 

rPN4 GCCCATTGCT GTGGGCGAGT CTGACTTCGA GAACCTCAAC ACAGAGGATG 3499 
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Fig. 3E:' NaCh6/PN4 alignment (SEQ ID NO:7) 



?-ATNaChSA 
r?N4 

?-ATNaCh6A 
r?N4 

RATNaChSA 
r?N4 

SATNaChSA 
r?N4 

PATNaChSA 
r?N4 

RATNaCiiSA 
r?N4 

RATNaChSA 
r?N4 

RATNaCh6A 
rPN4 

HATNaCh6A 
rPN4 

?-ATNaCh6A 
rPN4 

RATNaChSA 
r?N4 

PATNaChSA 
rPN4 

RATNaChSA 
rPN4 

RATNaChSA 
rPN4 

SATNaChSA 
r?N4 

EATNaChSA 
rPN4 

?-ATNaCh6A 
r?M4 

?-ATNaCh6A 
rPN4 



I^^ GCGA ATCAGACC - T GAAGGCAGCA AAGA7AAACT GGACGATACC 
- xAGCAGCGA ATCAGACCCT GAAGGCAGCA AAGATAAACT GGACGATACC 

AGCTCCTCAG AAGGAAGTAC CATCGACATC AAGCCTGAGG TGGAAGAAG™ 
AGCTCCTCAG AAGGAAGTAC CATCGACATC AAGCCTGAGG TGGAAGAAGT 

;X^23 GGAG CAA< =CTGAGG AATAC7TGGA TCCGGACGCC TGCTTTACAG 
T.CCuxGuAG CAACCTGAGG AATACTTGGA TCCGGACGCC TGCTTTACAG 

AGGGTTGCGT CCAGCGGTTC AAGTGCTGCC AGGTCAACAT CGAGGAAGG A 
^G^GTTGCGT CCAGCGGTTC AAGTGCTGCC AGGTCAACAT CGAGGAAGGA 

G I^ GGGAAGT C ^TGGTGGAT CTTGCGGAAA ACCTGCTTCC TCATTGTGGA 
Cx.-.GGv_AAGT CGTGGTGGAT CTTGCGGAAA ACCTGCTTCC TCATTGTGGA 

I rTGAGACCT TCATCA ^TT CATGATTCTG CTCAGCAGTG 
GCACAATTGG TTTGAGACCT TCATCATCTT CATGATTCTG CTCAGCAGTG 

G ^^" GGC CTTTGAGGAC A TCTACATTG AGCAGAGGAA GACCATCCGC 
GCGCCCTGGC CTTTGAGGAC ATCTACATTG AGCAGAGGAA GACCATCCGC 

ACCATCCTGG AGTATGCGGA CAAGGTCTTC ACCTACATCT TCATCCTGGA 
ACCATCCTGG AGTATGCGGA CAAGGTCTTC ACCTACATCT TCATCCTGGA 

GATGTTGCTC AAGTGGACCA CGTACGGCTT CGTCAAG^TC TTCACCAATG 
GATGTTGCTC AAGTGGACAG CCTACGGCTT CGTCAAGTTC TTCACCAATG 

CCTGGTGCTG GTTGGACTTC CTCATTGTGG CTGTCTCTT^ AGTCAGCC' r * r 
CCTGGTGCTG GTTGGACTTC CTCATTGTGG CTGTCTCTTT AGTCAGCCTT 

ATAGCTAATG CCCTGGGCTA CTCGGAACTA GGTGCCATAA AGTCCCTTAG 
~TAGCTAA7G CCCTGGGCTA CTCGGAACTA GGTGCCATAA AGTCCCTTAG 

GAGGGTAAGA GCTTTGAGAC C CTTAAG AG C CTTATCACGA TTTGAAGGGA 
G^CCCTAAGA G CTTTG AG AC CCTTAAGAGC CTTATCACGA TTTGAAGGGA 

TGAGGGTGGT GGTGAATGCC TTGGTGGGTG CCATCCCC^C CATCATGAA^ 
TGAGGGTGGT GGTGAATGCC TTGGTGGGCG CCATCCCCTC CATCATGAAT 

G 3 GG I GCTGG -GTGTCTCAT CTTCTGGCTG ATTTTCAGCA TCATCGGAG^ 
G J.GCTGCTGG TGTGTCTCAT CTTCTGGCTG ATTTTCAGCA TCATGGGAGT 

7AACCTGTT7 GCGGGGAAAT ACCACTACTG CTTTAATGAG ACTTCTGAAA 
iAACCTGTTT GCGGGGAAAT ACCACTACTG CTTTAATGAG ACTTCTGAAA 

Z GGGGTTCGA ~"CGATATT GTCAACAATA AAACGGACTG TGAGAAGCT- 
iCCGGTTCGA AA7CGATATT GTCAACAATA AAACGGACTG TGAGAAGCTC 

ATGGAGGGCA ACAGCACGGA GATCCGATGG AAGAATGTCA AG^TC^ACT^ 
ATGGAGGGCA ACAGCACGGA GATCCGATGG AAGAATGTCA AGATCAACTT 

TGACAATGTC GGAGCAGGGT ACCTGGCCCT TCTTCAAGTG GCAACCT^C^ 
TGACAATGTC GGAGCAGGGT ACCTGGCCCT TCTTCAAGTG GCAACCTTCA 



3375 
3S49 

3425 
3599 

3475 
3649 

3S25 
3699 

357S 
3749 

3625 
3799 

3675 
3849 

3725 
3899 

3775 
3949 

3825 
3999 

3875 
4 04.9 

3925 
4099 

3975 
4149 

4025 
4199 

4075 
4249 

4125 
4299 

4175 
4349 

4225 
4399 
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Fig. 3Fr NaCh6/PN4 alignment (SEQ ID NO:7) 



RATNaChSA. AAGGCTGGAT GGACATCATG TATGCGGCTG TAGATTCCCG AAAGCCAGAC 4275 

r?N4 AAGGCTGGAT GGACATCATG TATGCGGCTG TAGATTCCCG AAAGCCAGAC 444 9 

RATNaChSA GAGCAGCCTG ACTACGAGGG CAACATCTAC ATGT AC AT C T ACTTCGTCAT 4 325 

r?N4 GAGCAGCCTG ACTACGAGGG CAACATCTAC ATGTACATCT ACTTCGTCAT 44 99 

RATNaChSA CTTCATCATC TTCGGCTCCT TCTTCACCCT CAACCTGTTC ATCGGTGTCA 4375 

r?N4 CTTCATCATC TTCGGCTCCT TCTTCACCCT CAACCTGTTC ATCGGTGTCA 4 54 9 

RATNaChSA TCATCGACAA CTTCAACCAG CAGAAGAAAA AGTTTGGAGG TCAGGACATC 4425 

rPN4 TCATCGACAA CTTCAACCAG CAGAAGAAAA AGTTTGGAGG TCAGGACATC 4 599 

RATNaChSA TTCATGACAG AG G AAC AG AA G AAG TACT AT AATGCCATGA AAAAGCTGGG -475 

r?N4 TTCATGACAG AGGAACAGAA GAAGTACTAC AATGCCATGA AAAAGCTGGG 4o4 9 

RATNaCh6A CTCCAAGAAG CCACAGAAGC CCATCCCCCG ACCCTTGAAC AAAATCCAAG 4 525 

r?N4 CTCCAAGAAG CCACAGAAGC CCATCCCCCG ACCCTTGAAC AAAATCCAAG 46 99 

RATNaChSA GGATTGTCTT TGATTTCGTC ACTCAACAAG CCTTTGACAT TGTGATCATG 4 575 

rPN4 GGATTGTCTT TGATTTCGTC ACTCAACAAG CCTTTGACAT TGTGATCATG 4 74 9 

RATNaCh6A ATGCTCATCT GCCTTAACAT GGTGACAATG ATGGTGGAGA CAGACACTCA 4625 

rPN4 ATGCTCATCT GCCTTAACAT GGTGACAATG ATGGTGGAGA CAGACACTCA 4799 

RATNaChSA GAGCAAGCAG ATGGAGAACA TTCTTTACTG GATTAATCTG GTCTTTGTCA 467 5 

rPN4 GAGCAAGCAG ATGGAGAACA TTCTTTACTG GATTAATCTG GTCTTTGTCA 4849 

RATNaChSA TCTTCTTCAC CTGCGAGTGT GTGCTCAAAA TGTTTGCCTT GAGACACTAC 4 72 5 

r?N4 TCTTCTTCAC CTGCGAGTGT GTGCTCAAAA TGTTTGCCTT GAGACACTAC 48 99 

RATNaCh6A TACTTCACCA TTGGCTGGAA CATCTTTGAC TTTGTGGTGG TCATCCTCTC 4 77 5 

rPN4 TATTTCACCA TTGGCTGGAA CATCTTTGAC TTTGTGGTGG TCATCCTCTC 4 94-9 

RATNaChSA CATTGTGGGA ATGTTCCTGG CTGATATCAT TGAGAAGTAC TTCGTCTCCC 48 25 

rPN4 CATTGTGGGA ATGTTCCTGG CTGATATCAT TGAGAAGTAC TTCGTCTCCC 4 999 

RATNaChSA CAACCCTATT CCGAGTTATC CGATTGGCCC GTATTGGGCG CATCTTGCGT 4 8 75 

rPN4 CAACCCTATT CCGAGTTATC CGATTGGCCC GTATTGGGCG CATCTTGCGT 504 9 

RATNaChSA CTGATCAAGG GCGCCAAAGG GATCCGCACT CTGCTCTTTG CTCTGATGAT 4 92 5 

rPN4 CTGATCAAGG GCGCCAAAGG GATCCGCACC CTGCTCTTTG CCTTAATGAT 5099 

RATNaChSA GTCGCTGCCC GCCCTGTTCA ACATCGGCCT CCTGCTCTTC CTCGTCATGT 4 97 5 

rPN4 GTCGCTGCCC GCCCTGTTCA ACATCGGCCT CCTGCTCTTC CTCGTCATGT 514 9 

RATNaChSA TCATCTTCTC CATTTTTGGC ATGTCCAACT TCGCATACGT GAAGCACGAG 502 5 

r?i*4 TCATCTTCTC CATTTTTGGC ATGTCCAACT TCGCATACGT GAAGCACGAG 5199 

RATNaChSA GCCGGCATTG ACG AC ATGTT CAACTTCGAG AC ATTTGG C A ACAGCATGAT 507 5 

rPN4 GCCGGCATTG ACG AC ATGTT CAACTTCGAG ACATTTGGCA ACAGCATGAT 524 9 

RATNaChSA CTGTTTGTTC CAGATCACAA CGTCTGCTGG CTGGGATGGC CTGCTGCTGC 512 5 

rPN4 CTGTTTGTTC CAGATCACAA CGTCTGCTGG CTGGGATGGC CTGCTGCTGC 5299 
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Fig. 3G: NaCh6/PN4 alignment (SEQ ID NO:7) 

RATNaChSA CAATCCTGAA CZZZCCCZCT GACTGCAGCT TGGACAAAGA GCACCCAGGG 5175 

r?N4 CAATCCTGAA CCZCCCCCCT GACTGCAGCT TGGACAAAGA GCACCCAGGG 5349 

RATNaChSA AGTGGCTTCA AAGGGGACTG TGGGAACCCC TCGGTGGGCA TCTTCTTCTT 522S 

r?N4 AGTGGCTTCA AAGGGGACTG TGGGAACCCC TCGGTGGGCA TCTTCTTCTT 5 3 99 

RATNaChSA TGTGAGCTAC AT C ATCATCT CCTTCCTGAT TGTGGTGAAC ATGTGCATCG 5275 

r?N4 TGTGAGCTAC ATCATCATCT CCTTCCTGAT TGTGGTGAAC ATGTACATCG 5449 

RATNaChSA CCATCATCCT GGAGAACTTC AGCGTGGCCA CCGAGGAGAG CGCCGACCCT 5 32 5 

r?N4 CCATCATCCT GGAGAACTTC AGCGTGGCCA CCGAGGAGAG CGCCGACCCT 5499 

RATNaChSA CTGAGTGAGG ATGACTTCGA GACTTTCTAT GAGATCTGGG AGAAGTTTGA 5375 

r?N4 CTGAGTGAGG ATGACTTCGA GACTTTCTAT GAGATCTGGG AGAAGTTTGA 5549 

RATNaChSA CCCAGACGCC AC C CAGTTCA TCGAGTACTG TAAGCTGGCA GACTTTGCCG 5 42 5 

r?N4 CCCAGACGCC ACCCAGTTCA TCGAGTACTG TAAGCTGGCA GACTTTGCCG 5 599 

RATNaChSA ACGCCCTGGA GCACCCGCTC CGAGTACCCA AGCCCAACAC CAT CG AG CT C 54 7 S 

r?N4 ACGCCCTGGA GCACCCGCTC CGAGTACCCA AGCCCAACAC CATCGAGCTC 564 9 

RATNaCh6A ATCGCCATGG ACCTGCCCAT GGTGAGCGGA GATCGCATCC ACTGCTTGGA 5 525 

rPN4 ATCGCCATGG ACCTGCCCAT GGTGAGCGGA GATCGCATCC ACTGCTTGGA 5699 

RATNaChSA CATCCTTTTC GCCTTCACCA AGGCAGTCCT GGG AGACAGT GGGGAGTTGG S57S 

r°N4 CATCCTTTTC GCCTTCACCA AGCGAGTCCT GGGAGACAGT GGGGAGTTGG 5 74 9 

RATNaChSA ACATCCTGCG GCAGCAGATG GAGGAGCGGT TCGTGGCATC CAATCCTTCC 562 5 

r?N4 ACATCCTGCG GCAGCAGATG GAGGAGCGGT TCGTGGCATC CAATCCTTCC 5 79 9 

RATNaChSA AAAGTGTCTT ACGAAGCCTA TCAC-ACCAC TCTG CGGCGC AACGAGGAGG 5674 

r?N4 AAAGTGTCTT ACGA-GCCTA TCACAACCAC TCTG CGGCGC AAGCAGGAGG 5848 

RATNaChSA AGGTGTCTGC AGTGGTCCTG CAGCGTGCCT ACAGGGGACA CTTGGCTAGG 572 4 

rPN4 AGGTGTCTGC AGTGGTCCTG CAGCGTGCCT ACAGGGGACA CTTGGCTAGG 5898 

RATNaChSA CGGGGCTTCA TCTGCAGAAA GATGGCCTCC AACAAGCTGG AGAATGGAGG 5774 

rPN4 CGGGGCTTCA TCTGCAGAAA GATGGCCTCC AACAAGCTGG AGAATGGAGG 5 94 8 

RATNaChSA CACACACAGA GACAAGAAGG AGAGCACCCC GTCCACAGCC TCCCTCCCCT 582 4 

rPN4 CACACACAGA GACAAGAAGG AGAGCACCCC GTCCACAGCC TCCCTCCCCT 5993 

RATNaChSA CTTACGACAG CGTCACAAAG CCAGACAAGG AGAAGCAGCA G CGTGCGG AG 5874 

rPN4 CTTACGACAG CGTCACAAAG CCAGACAAGG AGAAGCAGCA G CGTGCGG AG 604 8 

RATNaChSA G AG G G C AG AA GGGAAAGAGC CAAGAGGCAA AAAGAGGTCA GGGAGTCCAA 5 92 4 

r?N4 GAGGGCAGAA GGGAAAGAGC CAAGAGGCAA AAAGAGGTCA GGGAGTCCAA 6 09 8 

scop 

RATNaChSA GTGCI&SAGG AGGGGAAAGG AAGCTTACCC CGGCTGAACA CTGGCAAGTG 5 97 4 

r?N4 GTGCIAGAGG AGGGGAAAGG AAGCTTACCC CGGCTGAACA CTGGCAAGTG 614 8 

RATNaChSA AAAGCTTGTT T AC AAACTT C CGAATCTCAC GGATGCAGA- CAGCTGTGCA 602 3 

r?N4 AAAGCTTGTT TACAAACTTC CGAATCTCAC GGATGCAGAG CAGCTGTGCA 619 3 
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Fig. 3H: .NaCh6/PN4 alignment (SEQ ID N0:7) 



RATN'aChSA 
rPN4 


GACGCTCGCT 
GACGCTCGCT 


GTACTGGAAG 
GTACTGGAAG 


ACCTATACCA. 
ACCTATACCA 


AACATAGTCT 
AACATAGTCT 


GCTTACATGT 
GCTTACATGT 


6073 
6248 


RATNaChSA 
rPN"4 


GACATGGTGG 
GACATGGTGG 


CATCCTGAGC 
CATC CTG AG C 


GGTGACTGCT 
GGTGA CT 


GCTGGGGACA 
G CTGGGG AC A 


AAGGACCCTG 
AAGGACCCTG 


6123 
6295 


RATNaCh6A 
rPN4 


CTCCCTGGAC 
CTCCCTGGAC 


TCACAGATCT 
TCACAGATCT 


CCTATCGCTT 
CCTATCGCTT 


GGGCAGACGG 
GGGCAGACGG 


TTACTGCATG 
TTACTGCATG 


5173 
6345 


RATNaChSA 
rPN4 


TTCCACACTT 
TTC CACACTT 


AGTCAATGCA 
AGTCAATGCA 


ACTTAGGACT 
ACTTAGGACT 


AAACTAACCA 
AAACTAACCA 


GGATACAAAA 
GGATACAAAA 


6223 
6395 


RATMaChSA 
rPN4 


CCGAGGCGGC 
CCGAGGCGGC 


TG--GCGACC 
TGCCGGGACC 


AG C AG ATCAC 
AGCAGATCAC 


CGCTGCAGCC 
CGCTGCAGCC 


AAAXGGATTT 
AAATGGATTT 


6271 
6445 


RATNaChSA 
rPN4 


TATTTTTTCA 
TATTTTTTCA 


TTTTGTTGAT 
TTTTGTTGAT 


TCTCAGAAGC 
TCTCAGAAGC 


AG AAAG CAT C 
AG AAAG CAT C 


ACTTTAAAAG 
ACTTTAAAAG 


6321 
6495 


RATNaCh6A 
rPN4 


TTTGTTTGTT 
TTTGTTTGTT 


CATGCAAACA 
CATNCAAACA 


ATATTTGAAT 
ATATTTGAAT 


TCTTACATTA 
TCTTACATTA 


GTTAAG CTAA 
GTTAAGCTAA 


6371 
6545 


RATNaCh6A 
rPN4 


GCAGCAAAAA 
GCANCAAAAA 


GAAACACACA 


CGCACACAGA 


CACACAAAGA 


CACACACACA 


6421 
6556 


RATNaChSA 
rPN4 


TTCAGCCTAT 


GTCACTAATC 


GTCTGTTTCT 


TTAACATAAC 


AGCATCTTCT 


6471 
6556 


RATNaChSA 
rPN4 


CCACACGAGC 


GGCACGTGGT 


TTGGAGATGG 


GTGGGGGAAA 


ATCAGGGTTT 


6521 
6 556 


PATNaCh6A 
rPN4 


CAGGCTGAGG 


AGGACTTGCT 


CAGGCCAATC 


C CAAATATGT 


GCTCGTTCAA 


6571 
6556 


RATNaCh6A 
rPN4 


TGCATAGAAG 


TGACCTGCAT 


GATGGCATGC 


TGTGTTCAGA 


AGTCATGCAT 


6621 
6556 


RATNaChSA 
rPN4 


GAGACCCACA 


CACCACAAGA 


CACTAGTACT 


CCTGTNNCCA 


TCCACAGGCT 


6671 
6556 


RATNaChSA 
rPN4 


CAGCCTGCGG 


ACAGGACCAG 


CCCTGCACCG 


TTCACTGTAT 


TTGGAGAAAT 


6721 
6556 


RATNaCh6A 
rPN4 


GGTAAGAGTT 


CCACACCGGC 


TGCAGTCCTC 


TCAGTGTAGG 


ATTCTTTCGT 


6771 
6556 


PATMaCho A 
r?N4 


ACACCTCTGG 


GTAGGGAGAC 


ATAATTAACC 


AATTGACCAC 


TACCAACAAA 


6821 
6556 



HATNaCh6 A ACAAT 63 25 
rPN4 esse 



BNSDOCID: <WO_9838302A2J_ > 



WO 98/38302 



PCT/EP98/00997 



15/23 



Fig. 4A:- PN4a/PN4/NaCh6 alignment 

M--AARLLAP FGPDSFKPFT FES LAN I ERR IAESKLKKP? KADGSHREDD 48 

M--AARLLA? PGPDSFKPFT PESLANIERR IAESKLKKP? KADGSHREDD 43 

h5A MRRSAJRLLA? PGPDSFKPFT PESLANIERR I AE S KLKK? ? KADGSHREDD 5 0 

EDSKPKPNSD LEAGKSLPFI YGD I FQGLVA VPLEDFDPYY LTQ KT FWLN 99 

EDSKPKPNSD LEAGKSLPFI YGDIPQGLVA VPLEDFDPYY LTQKTFWLN 93 

h6A EDSKPKPNSD LEAGKSLPFI YGDIPQGLVA VPLEDFDPYY LTQKTFWLN 100 



RGKTLFRFSA TPALYILSPF NLIRRIAIKI LIKSVFSHII MCTILTNCVF 14 8 

RGKTLFRFSA TPALYILSPF NLIRRIAIKI LIHSVFSMII MCTILTNCVF 148 
RGKTLFRFSA TPALYILSPF NLIRRIAIKI LIHSVFSMII MCTILTNCVF ISO 



L ii^ j [ 

rPN4a MTFSNPPEWS KNVEYTFTG I YTFESLVKI I ARGFCIDGFT FLRD P WNWLD 198 

rPN4 MTFSNPPEWS KNVEYTFTG I YTFESLVKI I ARGFCIDGFT FLRD 0 WNWLD 198 

RATNaChSA MTFSNPPEWS KNVEYTFTG I YTFESLVKI I ARGFCIDGFT FLRDPWNWLD 200 

--IS3---] [ IS4 ] 

rPN4a FSVIMMAYVT EFVDLGNVSA LRTFRVLRAL KTISVIPGLK TIVGALIQSV 248 

rPN4 FSVIMMAYVT EFVDLGNVSA LRTFRVLRAL KTISVIPGLK TIVGALIQSV 24 8 

RATNa Ch6 A FSVIMMAYVT EFVDLGNVSA LRTFRVLRAL KTISVIPGLK TIVGALIQSV 250 

[ IS5--- ] 

rPN4a KKLSDVMILT VFCLSVFALI GLQLFMGNLR NKCWWPINF NESYLENGTR 2 98 

rPN4 KKLSDVMILT VFCLSVFALI GLQLFMGNLR NKCWWPINF NESYLENGTR 2 98 

RATNa Ch a A KKLSDVMILT VFCLSVFALI GLQLFKGNLS KQCWWPINF NESYLENGTR 3 00 

rPN4a GFDWEEYINN KTNFYMVPGM LEPLLCGNSS DAGQCPEGFQ CMKAGRNPNY 348 

rPN4 GFDWEEYINN KTNFYMVPGM LEPLLCGNSS DAGQCPEGFQ CMKAGRNPNY 348 

RATNa Ch 6 A GFDWEEYINN KTNFYMVPGM LEPLLCGNSS DAGQC-EGFQ CSKAGRNPNY 349 



I IS6 

rPN4a GYTS FDTFS W AFLALFRLMT QDYWENLYQL TLRAAGKTYM IFFVLVIFVG 3 98 

rPN4 GYTS FDTFS W AFLALFRLMT QDYWENLYQL TLRAAGKTYM IFFVLVIFVG 3 98 

RATNa Ch 6 A GYTS FDTFS W AFLALFRLMT QDYWENLYQL TLRAAGKTYM IFFVLVIFVG 3 99 

] 

rPN4a S FYLVNL I LA WAMAYE SQN QATLEEAEQK EAEFKAMLEQ LKKQQEEAQA 44 3 

rPN4 S FYLVNL I LA WAMAYEEQN QATLEEAEQK EAEFKAMLEQ LKKQQEEAQA 448 

RATNaChSA SFYPVNLILA WAMAYEEQN QATLEEAEQK EAEFKAMLEQ LKKQQEEAQA 44 9 

rPN4a AAMATS AG TV SEDAIEEEGE DGVGSPRSSS ELSKLSSKSA KE RRNRR KKR 4 93 

rPN4 AAMATS AG TV SEDAIEEEGE DGVGSPRSSS ELSKLSSKSA KE RRNRR KKR. 4 98 

RATNa Ch 6 A AAMAT S AG TV SEDAIEEEGE DGVGSPRSSS ELSKLSSKSA KERPJ7RRKKR 4 99 

r?N4a KQKEVSZGZZ KGDPEKVFKS ESEDGMRRKA FRLPDNRIGR KF5IMNQSLL 548 

rPN4 KQKSLSSGSS KGDPEKVFKS ESEDGMRRKA FRLPDNRIGR KFSIMNQSLL 548 

RATNa Ch. 6 A KQKELSEGEE KGDPEKVFKS ESEYGMRRKA FRLPDNRIGR KFSIMNQSLL 549 

rPN4a SIPGSPFLSR KNSKSSIFSF RGPGRFRDPG SENEFADDEH STVEESEGRR 5 98 

rPN4 SIPGSPFLSR KNSKSSIFSF RGPGRFRDPG SENEFADDEH STVEESSG^R 598 

RATNaChSA SIPGSPFLSR HNSKSSIFSF GDPS-VRDPG SENEFADDEH STVEESEGRR 593 
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Fig. 4B:- PN4a/PN4/NaCh6 alignment 



r?N4a DSLFIPIRAR ERRSSYSGYS GYSQCSRSSR IFPSLRRSVX RNS7VDCNGV 64 8 

r?N4 DSLFIPIRAR ERRSSYSGYS GYSQCSRSSR IFPSLRRSVK RNSTVDCNGV 648 

PATNaChSA DSLFIPIRAR ERRSSYSGYS GYSQCSRSSR ISPACAQR-E ANSTVDCNGV 647 

rPN4a VSLIGPGSHI GRLLPEVKID KAATDSATTS VEIXXXGPGS LLVSMDQLAS 698 

rPN4 VSLIGPGSHI GRLLPE ATTE VEIKKXGPGS LLVSMDQLAS 688 

RATNaChSA VSLIGPGSHI GRLLLR QRLR WKLRRXALDS -FSFYGPTRL 686 

rPN4a YGRKDRINSI MSWTNTLVE ELEESQRKC? PCWYKFANTF LIWECHPYWI 748 

r?N4 YGRKDRINSI MSWTNTLVE ELEESQRKC? PCWYKFANTF LIWECHPYWI 73 8 

RATNaChSA LRTEGQNQQH NERGHKHAS E ELEESQRKC? PCWYKFANTF LIWECHPYWI 736 



[ IIS1 J L 

r?N4a KLKEIVNLIV MDPFVDLAIT ICIVLNTLFM AMEKHPMTPQ FEHVLAVGNL 798 

rPN4 KLKEIVNLIV MDPFVDLAIT ICIVLNTLFM AMEKHPMTPQ FEHVLAVGNL 788 

RATNaChSA KLKEIVNLIV MDPFVDLAIT ICIVLNTLFM AMEKHPMTPQ FEHVLAVGNL 786 

--IIS2-- -] [ — -IIS3 ] 

rPN4a VFTG I FTAEM FLKLIAMDPY YYFQEGWNIF DGFIVSLSLM ELSLADVEGL 848 

rPN4 VFTG I FTAEM FLKLIAMDPY YYFQEGWNIF DGFIVSLSLM ELSLADVEGL 838 

RATNaCh6A VFTG I FTAEM FLKLIAMDPY YYFQEGWNIF DGFIVSLSLM ELSLADVEGL 836 

[-- IXS4 1 C IISS 

rPN4a SVLRSFRLLR VFKLAKSWPT LNMLIKIIGN SVGALGNLTL VLAIIVFIFA 898 

rPN4 SVLRSFRLLR VFKLAKSWPT LNMLIKIIGN SVGALGNLTL VLAIIVFIFA 888 

RATNaCh6A SVLRSFRLLR VFKLAKSWPT LNMLIKIIGN SVGALGNLTL VLAIIVFIFA 8 86 



rPN4a 
rPN4 

RATNaChSA 



rPN4a 
rPN4 

RATNaCh6A 

rPN4a 
rPN4 

RATNaCh6A 

rPN4a 
rPN4 

RATNaChSA 



WGMQLFGKS YXECVCKINQ ECKLPRWHMN DFFHSFLIVF RVLCGEWIET 
WGMQLFGKS YKECVCKINQ ECKLPRWHMN DFFHSFLIVF RVLCGEWIET 
WGMQLFGKS YKECVCKINQ ECKLPRWHMN DFFHSFLIVF RVLCGEWIET 

[ IISS ] 

MWDCMEVAGQ AMCLIVFMMV MVIGNLWLN LFLALLLSSF SADNLAATDD 
MWDCMEVAGQ AMCLIVFMMV MVIGNLWLN LFLALLLSSF SADNLAATDD 
MWDCMEVAGQ AMCLIVFMMV MVIGNLWLN LFLALLLSSF SADNLAATDD 

DGEMNNLQIS VIRIKKGVAW TKVKVHAFMQ AHFKQREADE VKPLDELYEK 
DGEMNNLQIS VIRIKKGVAW TKVKVHAFMQ AHFKQREADE VKPLDELYEK 
DGEMNNLQIS VIRIKKGVAW TKVKVHAFMQ AHFKQREADE VKPLDELYEK 

KANCIANHTG VDIHRNGDFQ KNGNGTTSGI GSSVEKYIID EDHMSFINNP 
KANCIANHTG VDIHRNGDFQ KNGNGTTSGI GSSVEKYIID EDHMSFINNP 
KANCIANHTG VDIHRNGDFQ KNGNGTTSGI GSSVEKYIID EDHMSFINNP 



948 
938 
936 



998 
988 
986 

1048 
1038 
1036 

1098 
1088 
1086 



r?N4a NLTVRVPIAV GESDFENLNT EDVSSESDPE GSKDKLDDTS SSEGSTIDIK 114 8 

r?N4 NLTVRVPIAV GESDFENLNT EDVSSESDPE GSKDKLDDTS SSEGSTIDIK 1133 

RATNaChSA NLTVRVPIAV GESDFENLNT EDVSSESDPE GSKDKLDDTS SSEGSTIDIK 113 S 



rPN4a PEVSEVPVEQ PEEYLDPDAC FTEGCVQRFK CCQVNIEEGL GKSWWILRKT 1193 

rPN4 PEVEEVPVEQ PEEYLDPDAC FTEGCVQRFK CCQVNIEEGL GKSWWILRKT 118 8 

RATNaChSA PEVEEVPVEQ PEEYLDPDAC FTEGCVQRFK CCQVNIEEGL GKSWWILRKT 1186 
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Fig. 4C: PN4a/PN4/NaCh6 alignment 



r?N4a 
r?N4 

RATNaChSA 



rPN4a 
rPN4 

RATNaChSA 



rPN4a 
r?N4 

RATNaChSA 



rPN4a 
rPN4 

RATNaCh6A 



rPN4a 
rPN4 

RATNaCh6A 



rPN4a 
rPN4 

RATNaChS; 



rPN4a 
r?N4 

RATNaChSA 



rPN4a 
r?N4 

RATNaChSA 



rPN4a 
rPN4 

RATNaChSA 



rPN4a 
rPN4 

RATNaChSA 

rPN4a 
rPN4 

RATNaChSA 



C F L I V E HNW F E' 
CFLIVEKNWF E' 
CFLIVEKNWF E' 



--IIIS1 ] [ -IIIS2 

'FIIFMILL SSGALAFEDI YIEQRKTIRT ILEYADKVFT 

"FIIFMILL SSGALAFEDI YIEQRKTIRT ILEYADKVFT 

'FIIFMILL SSGALAFEDI YIEQRKTIRT ILEYADKVFT 



j t IIIS3 ] [ 

YIFILEMLLK WT A YG FVK F F TNAWCWLDFL IVAVSLVSLI ANALGYSELG 
YIFILEMLLK WTAYGFVKFF TNAWCWLDFL IVAVSLVSLI ANALGYSELG 
YIFILEMLLK WTTYGFVKFF TNAWCWLDFL IVAVSLVSLI ANALGYSELG 

IHS4 --] f IIIS5 

AIKSLRTLRA LRPLRALSRF EGMRVWNAL VGAIPSIMNV LLVCLIFWLI 
AIKS LRTLRA LRPLRALSRF EGMRVVVNAL VGAIPSIMNV LLVCLIFWLI 
AIKSLRTLRA LRPLRALSRF EGMRVWNAL VGAIPSIMNV LLVCLIFWLI 



■] 



FSIMGVNLFA GKYKYCFNET SEIRFEIDIV NNKTDCEKLM EGNSTEIRWK 
FSIMGVNLFA GKYHYCFNET SEIRFEIDIV NNKTDCEKLM EGNSTEIRWK 
FSIMGVNLFA GKYHYCFNET SEIRFEIDIV NNKTDCEKLM EGNSTEIRWK 

NVKINFDNVG AGYLALLQVA TFKGWMDIMY AAVDSRKPDE QPDYEGNIYM* 
NVKINFDNVG AGYLALLQVA TFXGWMDIMY AAVDSRKPDE QPDYEGNIYM 
NVKINFDNVG AGYLALLQVA TFKGWMDIMY AAVDSRKPDE QPDYEGNIYM 



- HISS--- ] 

YIYFVIFIIF G3FFTLNLFI GVIIDNFNQQ KKKFGGQDIF MTEEQKKYYN 

YIYFVIFIIF GSFFTLNLFI GVIIDNFNQQ KKKFGGQDIF MTEEQKKYYN 

YIYr n/IFIIF GSFFTLNLFI GVIIDNrNQQ KKKFGGQDIF MTEEQKKYYN 

C IVSl- ] 

AMKKLGSXX? QKPIPRPLNK IQGIVFDFVT QQAFDIVIMM LICLNMVTMM 
AMKKLGS KK? QKPIPRPLNK IQGIVFDFVT QQAFDIVIMM LICLNMVTMM 
AMKKLGSKKP QKPIPRPLNK IQGIVFDFVT QQAFDIVIMM LICLNMVTMM 

C IVS2-- ] [ 

V -TDTQSKQM ENILYWINLV FVIFFTCECV LKMFALRKYY FTIGWNIFDF 

VETDTQSKQM ENILYWINLV FVIFFTCECV LKMFALRKYY FTIGWNIFDF 

VETDTQSKQM ENILYWINLV FVIFFTCECV LKMFALRKYY FTIGWNIFDF 

--IVS3 ] [ IVS4--- ] 

VWILSIVGM FLADIIEKYF VSPTLFRVIR LARIGRILRL IKGAKGIRTL 
VWILSIVGM FLADIIEKYF VSPTLFRVIR LARIGRILRL IKGAKGIRTL 
VWILSIVGM FLADIIEKYF VSPTLFRVIR LARIGRILRL IKGAKGIRTL 



[■ 



■IVS5- 




FGNSMICLFQ ITTSAGWDGL LLPILNRPPD CSLDKEHPGS GFKGDCGN«S 
FGNSMICLFQ ITTSAGWDGL LLPILNRPPD CSLDKEHPGS GFKGDCGN D S 
FGNSMICLFQ ITTSAGWDGL LLPILNRPPD CSLDKEHPGS GFKGDCGNPS 



1248 
1233 
1236 



1298 
1288 
1286 



1348 
1338 
1336 



1398 
1388 
1386 



1448 
1438 
1436 



1498 
1488 
1486 



1548 
1538 
1536 



1598 
1588 
1586 



1648 
1638 
1636 



1693 
1668 
1685 

1748 
1738 
1736 



BNSDOCID; <WO 9838302A2J_> 



WO 98/38302 



PCT/EP98/00997 



18/23 



Fig. 4D: PN4a/PN4/NaCh6 alignment 
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r?N4a VGIFFFVSYI I IS FLIWNM YIAIILEMFS VATS E S AD P L SEDDFETFYS 1798 

rPN4 VGIFFFVSYI IISFLIWNM YIAIILSNFS VATEESADPL SEDDFETFYS 178 8 

RATNaChoA VGIFFFVSYI IISFLIWNM CIAIILENFS VATEESADPL SEDDFETFYS 178S 

r?N4a IWEKFDPDAT QFISYCKLAD FADALSHPLR VPKPNTIELI AMDLPMVSGD 1848 

r?N4 IWEKFDPDAT QFISYCKLAD FADALSHPLR VPKPNTIELI AMDLPMVSGD 18 3 8 

RATNaChoA IWEKFDPDAT QFIEYCKLAD FADALSHPLR VPKPNTIELI AMDLPMVSGD 1836 
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rPN4 RIHCLDILFA FTKRVLGDSG ELDILRQQME ERFVASNPSK VSYEPITTTL 1888 

RATNaCh6A RIHCLDILFA FTKAVLGDSG ELDILRQQME ERFVASNPSK VSYEAYHTTL 1886 

r?N4a RRKQEEVSAV VLQRAYRGHL ARRGFICRKM ASNKLSNGGT HRDKKSSTPS 1948 

r?N4 RRKQEEVSAV VLQRAYRGHL ARRGFICRKM ASNKLSNGGT HRDKKSSTPS 1938 

RATNaChSA RRNEESVSAV VLQRAYRGHL ARRGFICRKM ASNKLSNGGT HRDKKESTPS 1936 



rPN4a TASLPSYDSV TKPDKSKQQR ASSGRRSRAK RQKEVRESKC 1988 

rPN4 TASLPSYDSV TKPDKSKQQR ASSGRRSRAK RQKEVRESKC 197 8 

RATNaCh6A TASLPSYDSV TKPDKSKQQR ASEGRR5RAK RQKEVRESKC 1976 
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Fig. 5: PN4a/PN4/NaCh6/BrainII Interdomain I/II 
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